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Abstract 

The triangle-free process begins with an empty graph on n vertices and iteratively adds edges 
chosen uniformly at random subject to the constraint that no triangle is formed. We determine the 
asymptotic number of edges in the maximal triangle-free graph at which the triangle-free process 
terminates. We also bound the independence number of this graph, which gives an improved 
lower bound on the Ramsey numbers i?(3,f): we show R{3,t) > (1/4 — o(l))t^/ log t, which is 
within a 4 + o(l) factor of the best known upper bound. Our improvement on previous analyses 
of this process exploits the self-correcting nature of key statistics of the process. Furthermore, we 
determine which bounded size subgraphs are likely to appear in the maximal triangle-free graph 
produced by the triangle-free process: they are precisely those triangle-free graphs with 2-density 
at most 2. 

1 Introduction 

Constrained random graph processes provide both an interesting class of random graphs models 
and a natural source for constructions in graph theory. Although the dependencies introduced by 
the constraints make such processes difficult to analyse, the evidence to date suggests that they are 
particularly useful for producing graphs of interest for certain extremal problems. Here we consider 
the triangle-free random graph process, which is defined by sequentially adding edges, starting with 
the empty graph, chosen uniformly at random subject to the constraint that no triangle is formed. 
Formally, let G(0) be the empty graph on n vertices. At stage i we have a graph G{i); we denote 
its edge set by E{i), and let 0{i) be the set of pairs xy that are open, in that G{i) U {xy} has no 
triangle. We obtain G{i + 1) from G{i) by adding a uniformly random pair from 0{i). 

This process was introduced by Bollobas and Erdos (see [9j), and first analysed by Erdos, Suen 
and Winkler {12j , using a differential equations method introduced by Rucinski and Wormald |21j for 
the analysis of the constrained graph process known as the 'd- process'. One motivation for their work 
was that their analysis of the triangle- free process led to the best lower bound on the Ramsey number 
R{3, t) known at that time. The Ramsey number R{s, t) is the least number n such that any graph on 
n vertices contains a complete graph with s vertices or an independent set with t vertices. In general, 
very little is known about these numbers, even approximately. The upper bound R{3, t) = 0{t^ / log t) 
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was obtained by Ajtai, Komlos and Szemeredi jTj, but for many years the best known lower bound, 
due to Erdos [llj, was Q{t'^/log^t). The order of magnitude was finally determined by Kim [15j, who 
showed that R{3,t) = Q{t'^ / logt). He employed a semi-random construction that is loosely related 
to the triangle-free process, thus leaving open the question of whether the triangle-free process itself 
achieves this bound; this was conjectured by Spencer [23j and proved by Bohman [5]. There is now 
a large literature on the general H-hee process, obtained by replacing 'triangle' by any fixed graph 
H in the definition; seelHllIOllITllISllIHlISOllMlllZlISHllMlEa However, the theory is stiU very 
much in its early stages: we conjectured that our lower bound for H strictly 2-balanced, given in 
[8], gives the correct order of magnitude for the length of the process, but so far this has only been 
proved for some special graphs. 

In this paper we specialise to the triangle- free process, where we can now give an asymptotically 
optimal analysis. Our improvement on previous analyses of this process exploits the self-correcting 
nature of key statistics of the process. For a treatment of self-correction in a simpler context see f6]. 
The methods that we use to establish self-correction of the triangle-free process build on the ideas 
used recently by Bohman, Frieze and Lubetzky |7] for an analysis of the triangle-removal process. 
Furthermore, the results of this paper have also been obtained independently and simultaneously by 
Fiz Pontiveros, Griffiths and Morris; their proof also exploits self-correction, but is different to ours 
in some important ways. 

Let G be the maximal triangle-free graph at which the triangle-free process terminates. 



Theorem 1.1. With high probability, every vertex of G has degree (1 + o(l)) y ^nlogn. Thus the 
number of edges in G is (^^^ + o(l)^ (log n)^/^n^/^ with high probability. 

We also obtain the following bound on the size of any independent set in G. 

Theorem 1.2. With high probability, G has independence number at most (1 + o{l))y/2nlogn. 

An immediate consequence is the following new lower bound on Ramsey numbers. The best known 
upper bound is R{3,t) < (1 + o{l))t'^ / logt, due to Shearer [22] . 



Theorem 1.3. R{3,t) > (l - o{l)) t^ / logt. 

These results are predicted by a simple heuristic. The graph G{i) that we get after i steps of the 
triangle-free process should closely resemble the Erdos-Renyi random graph Gn,p with i = n?p/2, 
with the exception that Gn,p should have many triangles while G{i) has none. 



In addition to Theorems 1.1 and 1.2 we show that this heuristic extends to all small subgraph 
counts; in particular, we answer the question of which subgraphs appear in G. Suppose iJ is a graph 
with at least 3 vertices. The 2-density of H is d2{H) = The maximum 2-density m2{H) of 

H is the maximum of d2{H') over subgraphs H' of H with at least 3 vertices. 

Theorem 1.4. Let H be a triangle-free graph with at least 3 vertices. 

(i) Ifm2{H) < 2 then F{H C G) = 1 - o(l). 

(ii) Ifm2{H) > 2 then F{H C G) = o(l). 
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Thus, the small subgraphs that are likely to appear in G are exactly the same as the triangle- free 
subgraphs that appear in the corresponding Gn,p- 

Note that the lower bound on -R(3, t) given by the triangle-free process is non-constructive; for 
an explicit construction of a triangle-free graph on 0(i^/^) vertices with independence number less 
than t see Alon [2]. Alon, Ben-Shimon and Krivelevich [3] gave a construction that can be applied 
to G to produce a regular Ramsey R{2>, t) graph, at the cost of a worse constant in the lower bound 
on R{3,t). 

The bulk of this paper is occupied with the analysis required for the lower bound in Theorem 
To prove this, we in fact prove much more generally that we can 'track' several ensembles of 
'extension variables' for most of the process; this is formalised as Theorem 2.2 The proof of Theorem 



2.2 is outlined in the next section, then implemented over the four following sections. In Section 3 



we present some coupling and union bound estimates that are needed throughout the paper, and 
also prove Theorem 1.4, assuming Theorem 2.2 In Sections 4, 5 and 6, we prove Theorem 2.2 via a 



self-correcting analysis of three ensembles of random variables. Section 7 is mostly occupied by the 
proof of Theorem 1.2 it also contains the proof of the upper bound in Theorem |1.1[ which is similar 
and easier. We conclude with some brief remarks in Section 8. 



2 Overview of lower bound 

In this section we outline the proof of the lower bound in Theorem |1.1[ We are guided throughout 
by the heuristic that G{i) should resemble Gn,p with i = n?p/2. Before proceeding with the outline 
of the proof we mention a consequence of this heuristic that is central to the entire argument. We 
introduce a continuous time that scales as 

t = in-^/^ 

Note that p = 2tn^^^'^. We define Q{i) to be the number of open ordered pairs in G{i). (So 
Q{i) = 2|0(i)|.) This variable is crucial to our understanding of the process. We have Q{0) = n? — n, 
and the process ends exactly when Q{i) = 0. How do we expect Q{i) to evolve? If G{i) resembles 
Gn,p then for any pair uv we should have 

Fiuv G 0{i)) « (1 -/)""^ « e-"P' = e-^'\ 

We set q{t) = e~^*^n^ and expect to have 

Q{i) « q{t) 

for most of the evolution of the process. This is exactly what we prove. 
2.1 Strategy 

We use dynamic concentration inequalities for a carefully chosen ensemble of random variables as- 
sociated with the process. We aim to show V{i) ~ v{t) for all variables V in the ensemble, for some 
smooth function v{t), which we refer to as the scaling of V. Here V{i) denotes the value of V after i 
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steps of the process, and we scale time as t = in^^^"^. For each V we define a tracking variable TVii) 
and aim to show that 'DV{i) = V{i) — TV{i) satisfies < ev{t)v{t), for some error functions 

ey(t). We use TV{i) rather than v{t) so that we can isolate variations in V from variations in other 
variables that have an impact on V. 

The improvement to earlier analysis of the process comes from 'self-correction', i.e. the mean- 
reverting properties of the system of variables. We take ev{t) = fvit) + '^9v{t)i where we think 
of fvit) as the 'main error term' and gvit) as the 'martingale deviation term'. We usually have 
gv ^ fvi but there are some exceptions when t is small and hence fvit) is too small. We require 
gvit)vit) to be 'approximately non-increasing' in t, in that gvit')vit') = Oigvit)vit)) for all t' > 
We define the critical window 

Wvii) = [ifvit) + gvit))vit), ifvit) + 2gvit))vit)]. 

We aim to prove the trend hypothesis: ZVii) := \T>Vii)\ — evit)vit) is a supermartingale when 
E The trend hypothesis will follow from the variation equation for ey(t), which 

balances the changes in T>Vii) and ey(t)v(i). Since errors can transfer from one variable to another, 
each variation equation is a differential inequality that can involve many of the error functions. 
We aim to track the process up to the time 

tmax = 2\/(l/2-e)logn. 

More precisely, we will define a stopping time / such that if I > z then every 'good' variable in our 
ensemble satisfies the required estimates at step i (the distinction between 'good' and 'bad' variables 
is discussed in Section 2.4). Setting imax = tmax'n^^'^ , it will suffice to show that / > imax with high 
probability. If / < imax then it will follow that there exists i* = I < imax and a 'good' variable 
V such that DVii*) is too large. In this situation DVii) enters Wvii') from below at some step 
i' < i*, stays in Wvii) for i' < i < i* then goes above Wvii*) at step i*. During this time ZVii) is 
a supermartingale, with ZVii') < —gvit')vit') and ZVii*) > 0, so we have an increase of at least 
gvit')vit') against the drift of the supermartingale. Then we use Freedman's martingale inequality 
|13J, which is as follows. 

Lemma 2.1 (Freedman). Suppose (X(i))j>o is a supermartingale with respect to the filtration T = 
i^i)i>o- Suppose that X(i + 1) — Xii) < B for all i and define = Yll=i VariXii) \ Fi-i). Then 
for any a,v > we have 

P i3i such that Xii) > X(0) + a and Vii) < v) < exp 

To apply Freedman's inequality, we estimate 

Varvit) = VariZVii) \ Fi-i) and iVy(t) = \ZVii + 1) - ZVii)\. 

^There will be one exceptional type of variable, the vertex degrees, for which this does not hold. 

^We generally work with the 'upper critical window', i.e. we consider |DV(i)| — Wij). The situation where the 
difference variable is negative can be treated in exactly the same way with reversed signs. We also remark that we 
need to 'freeze' ZV{i) if V becomes 'bad', as explained below. 



2iv + Ba) 
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Since gy{t)v{t) is approximately non-increasing (unless y is a vertex degree variable), it suffices to 
have 

gv{tfv{tf>Varv{t){n\ognf/'^ and gv{t)v{t) > Nv{t) log n 

to get the required bound, with subpolynomial failure probability. We refer to this as the boundedness 
hypothesis. 



The lower bound of Theorem 1.1 follows from an application of the union bound to the union 
of these events over all variables we consider and all points i' when such a variable might enter the 
critical interval. Thus, after establishing the trend and boundedness hypotheses for all variables, we 
may conclude that < ev{t)v{t) for all t < tmax with high probability. In other words, we 

can conclude / > imax with high probability. 



2.2 Variables 

All definitions are with respect to the graph G{i). Sometimes we use a variable name to also denote 
the set that it counts, e.g. Q{i) is the number of ordered open pairs, and also denotes the set of ordered 
open pairs. We usually omit {i) and {€) from our notation, e.g. Q means Q{i) and q means q{t). 
We use capital letters for variable names and the corresponding lower case letter for the scaling. We 
express scalings using the (approximate) edge density and open pair density; these are respectively 

p = 2m~^ = 2tn"^/2 and g = e~^*^ 

The next most important variable in our analysis, after the variable Q defined above, is the 
variable Y^v which, for a fixed pair of vertices uv, is the number of vertices w such that uw is an 
open pair and vw is an edge. It is natural that Yuv should play an important role in this analysis, 
as when the pair uv is added as an edge, the number of open edges that become closed is exactly 
Yuv + Yvu- The motivation for introducing the ensembles of variables defined below is as follows: 
control of the global variables is needed to get good control of Q, control of the stacking variables 
is needed to get good control of Y^v, and controllable variables play a crucial role in our analysis of 
the stacking variables. 

We note that the proof of the fact that we can track the controllable variables (up to the precision 
needed for our purposes) is relatively short. In a certain sense, our results on controllable variables 
can be viewed as a triangle-free process analog of the concentration on subgraph extensions that 
follows from Kim-Vu polynomial concentration |16j . (We remark in passing that a similar analog 
should hold for the triangle removal process, and the introduction of this idea would simplify the 
analysis of the triangle removal process recently given by Bohman, Frieze and Lubetzky [7].) 



2.2.1 Global variables 

We begin with the variable that we are most interested in understanding: the number of open pairs. 
We also include two other variables that will allow us to maintain precise control on the number of 
open pairs. 

• Q = 2|0(i)| is the number of ordered open pairs. The scaling is q = qv? . 
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• Ris the number of ordered triples with 3 open pairs. The scahng is r = q^n^. 

• S is the number of ordered triples abc where ab is an edge and ac, be are open pairs. The 
scaling is s = pq^n^ = Iti^v^l'^ . 

We refer to Q, R and S as global variables. 
2.2.2 Stacking variables 

In order to understand the evolution of Q, R and S, we introduce a large collection of stacking 
variables. The basic building blocks for the stacking variables are the following one-vertex extensions, 
which are defined for every ordered pair uv. 

• Xuv is the number of vertices w such that uw and vw axe open pairs. The scaling is a:: = q^n. 

• Yuv is the number of vertices w such that uw is an open pair and vw is an edge. The scaling 
is y = 2tqn}-/'^. 

We also need information about the degree and the open degree of a vertex. 

• Xu is the open degree of u, defined as the number of vertices oj such that uoj is open. The 
scaling is x\ = nq. 

• Yu is the degree of u, defined as the number of vertices u such that vio is an edge. The scaling 
is ui = 2tn^l'^. 

We refer to Y^^ and X^^ as codegree variables, to 1^ as a degree variable, and to X^ as an open degree 

variable. 

A stacking variable is an iterated extension variable, in which each iteration adds an extension 
from the collection Xy,Yy, X^y, Y^y, Yy^, where y is the vertex added in the previous extension, and 
xy is an open pair in the previous extension. We will only track a subset of the collection of the 
stacking variables, the M-bounded stacking variables, which are defined below. Let S be the set of 
sequences 

TT G {0,E,yO,X°} X {0,E,Y^,Y^,X^,X^y 

such that if E occurs then it only does so as the last symbol of vr. For any vr € 5 and pair of vertices 
uv (such that uv E{i)) we define 5,^^, according to the following rules. Each clement of vr indicates 
the next extension to be added and specifics an open pair in this extension to be the next rung in 
the stacking variable. We initiate by letting uv be the active rung and letting v be the last vertex. 
Suppose we have constructed i — \ steps of our stacking variable and that we have an active rung 
xy with last vertex y. If 7r(z) = O then the next step is an Xy extension with the single open pair in 
this extension the next rung. If 7r(i) = E then the next step is an Yy extension and then there is no 
active rung: the variable terminates here. 

Now suppose 7r(i) ^ {0,E'}; that is, suppose Ti{i) indicates an X ox Y extension on the active 
rung. The superscript indicates the direction of this extension. For Y it determines whether we add 
Yxy or Yyx, and the new open pair becomes the active rung. For X it determines which of the two 
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Figure 1: The stacking variable corresponding to tt = Y^X^X^Y'^OY'^X^OOY^X^OE. Thick 
hnes represent edges and thin hnes represent open pairs. 

new open pairs becomes the active rung. In both cases, a superscript of O (for 'outer') indicates that 
the new active rung is incident with the last vertex, y, while a superscript of I (for 'inner') indicates 
that the next active rung is not incident with y (i.e. it is incident with x). We say that an open pair 
of TT that is not a rung is a stringer open pair. Note that each occurrence of X in tt gives one stringer 
open pair. Edges of tt are called stringer edges. Note that any rung is a cutset of tt. 

The simplest stacking variables are those of length 1, namely = X^y, = S^^ = Xy 
and S^y = Yy. The last example illustrates the general phenomenon that when 7r(l) G {0,E} we 
obtain an extension based at the single vertex v, which does not depend on u. While we could denote 
this variable more simply by Sy, it is convenient to have a unified notation for stacking variables 
that allows the base of the extension to have one or two vertices. 

Formally, we can view S^y as counting the number of injections ip : {a^, ay, ai, . . . , a^-,^^} [n] 
such that ip{au) = u, ip{ay) = v and ip{ai) is a vertex that plays the role in the extension defined 
by 7r(z) for i = l,...,|7r|. Fix two rungs axOiy and aaCKft, where x < y < a < b and none of 
7r(y + l),...7r(6) are O or E. Let TT[xy,ab] be the portion of S^y induced by the set of vertices 
{ox, ■ ■ . We call this structure the triangular ladder of vr cut off by xy and ah. Note that any 
stacking variable is a concatenation of some number of triangular ladders and paths of open pairs, 
possibly ending with a pendant edge. 

The concept of direction may be clarified by the following pictorial representation. We visualise 
Tr[xy, ab] as a horizontal strip of two rows, 'top' and 'bottom', where x and y are the leftmost elements 
of the two rows, and a and b are the rightmost two elements of the two rows (in some order). The 
remaining vertices are assigned so that rungs have one vertex in each row, whereas stringers lie 
within rows (this uniquely defines the assignment). Our convention for superscripts in X and Y 
corresponds to walking between the rows and describing whether the new vertex is added on the 
same row or the opposite row; thus / indicates that the new vertex is added to the same row, and O 
that it is added to the other row. Conversely, any such drawing determines a unique order vi, . . . ,vt 
of vertices, which we call the stacking order, from which we can reconstruct vr. Figure 1 illustrates 
a stacking variable corresponding to vr = Y'-^X'^X'^Y'^OY'^X^OOY^ X'-^OE (thick lines represent 
edges and thin lines represent open pairs). 

To define the M-bounded stacking variables that play a key role in the proof, we need some 
additional terminology. Consider a triangular ladder Tr[ab,xy]. We say that ai is a turning point in 
this ladder if the superscript of Tr{i + 1) is O. Note that if ai is a turning point then it is in at least 
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two rungs. The open pairs containing are a^-Oi and ajai for i + 1 < j < i^, for some i~ and 
i"*", which are respectively the previous and next turning points (or non-existent if there are no such 
turning points). If aj is in the top row (for example) then a^- and aj, i + 1 < j < i'^ are consecutive 
along the bottom row. 

We define three weights associated with tt G 5 as follows. The 1-weight wi{Tr) is the number of 
positions in tt that contain the symbols O or E. The 2-weight W2{tt) is the number of positions in tt 
that contain the symbols X'-^ or Y*^ . The weight w{tt) is wi{'k) + 'W2{'k)- An M-fan in tt is a sequence 
of M + 1 consecutive positions of tt whose entries are in the set 

{x°,yO} X {x^y^}^-i X {x^,Y^,xO}. 

Note that an Af-fan corresponds to a sequence of M + 1 vertices in the same row of a triangular 
ladder (flipping the position of the last element of the sequence if the last element is X^). We say 
that a stacking variable tt is M -hounded if it satisfies the following conditions. (Note that the first 
of these is a condition on any tt e S which we copy here for ease of reference.) 

(i) If E occurs in tt then it is the last symbol, 

(ii) The sequences OY^ and OX^ do not appear in tt in any pair of positions other than the last 
two positions, 

(iii) w{7t) < 2M, 

(iv) If w{Tr) = 2M then the last symbol in tt is O, -B, X^ or Y'^, and 

(v) TT does not contain an M-fan. 

We let Sm be the set of M-bounded stacking variables. Note that such a variable has length at most 
2M2 -M + 2< 2M2. 

2.2.3 Controllable variables 

Finally, we formulate a very general condition under which wc have some control on a variable. 
Suppose r is a graph, J is a spanning subgraph of F and A C Vp- We refer to {A, J.T) as an 
extension. Suppose that (j) : A ^ [n] is an injective mapping. We define the extension variables 
X(t>,J,r{i) to be the number of injective maps / : — ^ [n] such that 

(i) / restricts to on A, 

(ii) /(e) G E{i) for every e e Ej not contained in A, and 

(iii) /(e) G 0{i) for every e & Er\ Ej not contained in A. 

We call (J, F) the underlying graph pair of X^,j,r- We introduce the abbreviations V = X<^,j,r, 
n{V) = \Vt\-\A\, e{V) = ej - ejiA], and o{V) = {er - ej) - {er[A] - ej[A])- 
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The scaling is v = xa,j,t = n"^^^p^^^^q°^^\ We expect V ^ v, provided there is no subextension 
that is 'sparse', in that it has scaling much smaller than 1. Given A Q B <^ B' <^ Vr we write 

gB' ^ Sb'{J,T) = nl^'l~l'^l/j[s'l"''-'M^(er[s']-ej[B'])-(er[B]-ej[B])_ 

For example, S^^ = v. Note that ii A C B C B' C B" C Vr we have S§" = S%' . Note also that 
the letter 'S' is used for scalings and stacking variables, but there is no possibility for confusion, as 
the use is determined by the form of the superscript. 

Let t' >!. We say that V is controllable at time t' if J 7^ F (i.e. at least one pair is open) and 
for 1 < t < f and ^ C S C l/p we have 

where 5 > is a fixed global parameter that is sufficiently small given e. We note in passing that 
this condition is essentially identical to the condition needed to prove concentration of subgraphs 



counts in Gn,p using Kim-Vu polynomial concentration [16]. (See Lemma 3.1 below.) We say that 
V is controllable if it is controllable at time 1. 

2.3 Tracking variables 

Recall that each variable V has a tracking variable TV and we track the difference W = V — TV. 
We do this to isolate variations in V from other variations in G{i). 

The tracking variables are defined as follows. For the global variables we take 

TQ = q, TR = n^ ■ {Q/n^f = Q^n~^ TS = r? ■ 2tn'^/^ ■ {Q/n'^f = Itn'^/^Q^. 
If y is a one-vertex extension with a edges and b open pairs we take 

TV = n- (2tn-i/2)a . (^Q/n^f_ 

That is, we set TX^v = Q'^n^^ and TY^v = 2tn-^/^Q and TXu = Qn"^ . 

For the stacking variable with |7r| > 2 we have two cases, depending on the form of vr. If 
-7r(|7r| — 1) 7^ O or 7r(|7r|) G {0,£'} we write vr = vr" • ?7, where U is the last element of vr. Then 

Now suppose vr(|vr| — 1) = O and vr(|vr|) ^ {0,E}. We say that the open pair a|7r|-2«|7r|-i aiid 
the pair Q!|^|_2a|7r| (which can be an edge or an open pair) are partner pairs. In this case we need to 
modify the tracking variable to take account of the rule concerning OY^ and OX^ in the definition 
of We write vr = tt~OU , where U is again the last element of vr, /3 = a|7r|_2 and 

'E/es- ■ Q^~' if ^ e {X',XO} 

rs:^ = EfJ^; ^/%) • 2in-V2 if ^ = 

where Yj, denotes the degree of the vertex b. 

The estimates that we prove for controllable variables are much weaker than the estimates for 
the variables discussed above. For a controllable variable V we simply take the tracking variable to 
be the scaling v. 
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2.4 Error functions, stopping times, estimates, conventions, notation 

We begin with the constant M. The value of M depends on the length of time for which we want 
to track the process. Throughout the paper we fix a global parameter e > 0, which can be taken 
arbitrarily small. We track the process until the time tmax at which q{tmax) = n~^^'^^^. Thus 

tmax = ^\/(l/2-£)logn. 

We set 

M = 3/e. 

Now we turn to the error bounds for variables in our three ensembles of variables. We define 
parameters 

e = q'^/^n"^^'^ and L = y^ogn. 
Our error functions take the form 

ev = fv + ^9V, 

where 

fy = cv(f>v and gv = cv'&L-^l + t-<^^)(j)v 

and 4>v is one of e = g~^/^n~^/^, or = q~^n~^^'^ or . We take (pv = ^ when V is a stacking 
variable, (f)v = when F is a global variable, and = when F is a controllable general extension. 

The function -d is introduced to deal with some technicalities that arise for small t. We let K be 
a constant such that K > and let ■d{t) be any increasing smooth function such that 'd{t) = e^* 
for < t < 1, < 2e^ for all t and supj>o is finite. Note that the power of t in gv is 

chosen so that the dominant term in vgy as t — t- does not contain a power of t. All of our variables 
except for vertex degrees have at least one open pair; thus vgy has a non-negative power of q, so is 
approximately non- increasing (i? makes it increase by a constant factor). 

The constants cy are all polylogarithmic. We specify them now in advance of the analysis, but 
we will keep the notation general so that it is clear how to choose the constants. For all controllable 
variables we take cy = 1. For the global variables we take 

^ _r40 ^ _or40 ^ _/|r40 
cr — 1j , cs — 2.L , cq — 4:L . 

The constants for the stacking variables need to be chosen very carefully. The idea is that the 
constants should decrease as the length of vr increases, and there is a more substantial decrease for 
each occurrence of O or E. For a stacking variable V = S^^ we set 

cy = = LiSg^^'-H-^^'iW. 

Now we define various stopping times associated with the process. Consider any variable V in 
our ensembles, and write V = X^^j^y for some extension {A, J, F). We say that V is had (at step i) if 
there is an edge e in G{i) such that (J, F) -|-^~^(e) contains a triangle or a path of length two joining 
the vertices of an open pair. If V is not bad we say that V is good. Note that if 4> is bad then y = 0, 
so certainly it is not following its expected trajectory. For example, if uv is an edge then Y^v is bad. 
We let Jy be the smallest i > iy such that V is bad, or oo if there is no such time. As indicated 
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earlier, we modify the definition above of the variable ZV{i) by defining ZV{i) = ZV{ Jy — 1) for 
i > Jy to ensure that it trivially satisfies the supermartingale condition. 

We only start tracking variables using differential equations at the point where we require a 
better bound than that given by coupling, in that the coupling bound does not rule out the variable 
entering its critical window. More precisely, we start tracking the variable V at the first value i = iy 
such that i > rc'l'^ and gy(t) < L^^. It follows that ey = o(l) for any t > ty = iyn^^/'^ for all 
variables in our ensembles. 

Now we let ly be the smallest i with iy < i < Jy such that > ey{t)v{t), or oo if there 

is no such time. Then we let /i, I2, I3 respectively be the minimum of Iy over all variables V in 
the global, controllable, stacking ensemble. Finally, we let / = min{/i, /2, /s}. Note that if / < imax 
then there is some V such that I = Iy = i* < imax-, i-e. W{i*) is too large and V is good at step i* , 
as required for the strategy described above. We emphasize that, since we can restrict our attention 
to i < /, we may assume that < ey{t)v{t) for all good variables V when verifying the trend 

and boundedness hypothesises. 

We prove the following theorem over the following four sections. In the next section we estimate 
errors in variables for small t by coupling to the usual random graph. We then apply the strategy 
stated above to the global, stacking and controllable ensembles, respectively, in Sections 4, 5 and 6. 

Theorem 2.2. With high probability I > imax- 



Note that Theorem |l.l| follows immediately from Theorem 2.2 We will employ the following useful 
lemma extensively to estimate sums of products. The proof given here is due to Patrick Bennett. 

Lemma 2.3. (Product Lemma) Suppose (xj)jg/ and {yi)i(zj are real numbers such that \xi — x\ < S 
and \yi — y\ < e for all i & I. Then we have 



iei ' ' Vie/ / Vie/ 



Vi 



< 2\I\5e 



Proof. The triangle inequality gives 



^{xi - x){yi - y) 



i&I 



< \I\5e. 



Rearranging this inequality gives 

Xiyi = x'^yi + y'^Xi- \I\xy ± \I\5e 



iei 



' ' Vie/ / Vie/ / V ^ iei / V ' ie/ / 



□ 



We conclude this Section with some notation and conventions that are used throughout the paper. 
We use compact notation for one-step differences, writing Aj(F) = F{i + 1) — F{i) for any sequence 
F{i) and Ai{f) = f{{t + l)n^/'^) — f{tn^/'^) for any function f{t). We also use the following notational 
conventions: 
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• We use the '0-tilde' notation / = 0{g) to mean |/| < (logn) for some absolute constant 
A. 

• We use the abbreviation 'whp' for 'with high probabihty'; all such statements will have sub- 
polynomial failure probability, which will justify us taking a polynomial number of them in 
union bounds. 

• We denote the vertex set by [n] = {1, . . . , n}. 

3 Coupling and union bounds 

In this section we gather two types of estimates that can be made without using dynamic concen- 
tration, namely coupling and union bounds. 

3.1 Extension variables in G{n,p) 

We start by considering extension variables in the Erdos-Renyi random graph G{n,p). Suppose J is 
a graph and A C Vj. We refer to {A, J) as an extension. Given an injective map (f) : A ^ [n], we 
let X^^j be the number of injective maps f ■ Vj ^ [n] such that / restricts to on A and /(e) is 
an edge of G{n,p) for every e £ J. Given A Q B Q B' Vj we define the scaling S^' = Sg {J) = 
nl-B'|-|s|/j[s']-<=j[s]. Note that if ^ C B C 5' C C we have Sf = S%' . We say that 
{A, J) is strictly balanced with respect to G{n,p) if S^'' < 1 for all ^ C 5 C Vj. The extension series 
for {A, J), denoted {Bq, . . . , Bd), is constructed by the following rule. We let Bq = A. For i > 0, if 
(Bi, J) is not strictly balanced then we choose -Bj+i to be a minimal set C with Bi C Vj that 
minimises S^,; otherwise we choose -Bi+i = Vj, set d = i + 1 and terminate the construction. 

To control extensions we quote the following result of Kim and Vu [16, Theorem 4.2.4] in a 
weakened form that suffices for our purposes. 

Lemma 3.1. Suppose {A, J) satisfies > n" for all A C B <^ Vj in G{n,p) for some absolute 
constant a > 0. Then there is an absolute constant /? > such that whp X^j = (1 it n~^)S^ for 
all injections (p : A ^ [n]. 

We also require a weaker estimate that also applies to sparse extensions, as given by the following 
union bound lemma. We include a brief proof as it illustrates a method we will also use for similar 
estimates in the triangle-free process. 

Lemma 3.2. If {A, J) is strictly balanced then whp X^^j < L^l^-'l maxIS"]^^, 1} for all injections 
(j): A^ [n]. 

Proof. First we note that for any fixed f ■ Vj ^ [n] restricting to 4> on A we have P(/ G X^^j) = 
p<^j-f^j[A\ _ Next we estimate the probability that there are s extensions in X^^j that are disjoint outside 
of (l){A). An upper bound is s\^'^{n'"-^~\^\y ■ {p'^J~^J[A\y ^ (2>s^^ S^Y , which is subpolynomial for 
s = L^maxjS^-^, 1}. Now we show by induction on \ Vj\ — \A\ that X^^j < L'^(\'^j\~\^\^ maxjS^^-', 1}. 
The base case |Vj| — |A| = 1 holds by the bound on disjoint extensions. By strict balance and 
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the induction hypothesis, at most 2I^'^IL'^(I^I I"^' < L'^^'^l '"^'^ ^ embeddings intersect any fixed 
embedding outside of (f){A). So X^j^r < L^d^l-I^D^^s < L^d^l-I^D max{5X, 1}. □ 
For general extensions we obtain the fohowing bound by applying the previous lemma to each 
step of the extension series, noting that S^''_^^ > 1 for i > 1. 

Lemma 3.3. For any extension {A, J) whp X^^j < L^'^-'l maxACBCVj for all injections (j) : A ^ 
[n]. 



3.2 Coupling estimates 

Our error functions are chosen to control the process for as long as possible, and the interesting range 
is t = O(logn). However, the powers of t cause the functions to be badly behaved for small t. So 
we need alternative estimates for t = o(l), which we obtain by coupling to the usual Erdos-Renyi 
random graph process. We start tracking a variable V using differential equations at the first time 
t > n~^/^ where we require a proportional error that is o(l) (we will be more precise below). Note 
that we have to exclude very small t to obtain concentration, and we will see from the calculations 
below that t = n~^/^ is a natural starting point. 

Let ER{n,j) denote the Erdos-Renyi random graph process with n vertices and j edges, where 
at each time step we choose the next edge to be a random pair that is not yet an edge. To obtain 
the triangle-free process we modify ER{n,j) by rejecting any pair that is closed, in that it forms 
a triangle with the edges already selected. After j steps the selected edges form the triangle-free 
process G{i) at i steps, where j — i edges were rejected. The number of rejected edges is bounded 
by the number of triangles in ER{n,j); call this T. We can approximate ER{n,j) by the binomial 
model where edges are chosen independently with probability pQ = Ijjr?. If po > n"^"*"" for some 



a > then by Lemma 3.1 we have T < 2p^v? = 0{j/n)^ whp. We consider j = Q{n^^^^^) with 
< c < 1/4 so that T = 0(n~^/^'*'^'^)j. Then i = {1 + 0(n~-'^/^+^'^))j, so we can approximate 
G{i) by G{n,po) with po = {I + 0{n-^/^+^''))p. Recall that we aim to prove V = v{l ± ey) where 
= fv + "^gv for all variables that we track. We do this by showing that the probability that 
T>V = V — TV ever crosses the critical interval 

iifv + gv)v, ifv + 2gv)v) 

is very small. In order to circumvent technical issues with powers of t when t is small we begin the 
critical interval analysis at the smallest index i such that i > n^^^ and gvit) ^ L^^- Recall that we 
denote this step by iy- Let ty = iv'n~^^'^ be the corresponding time. If the structure that is counted 
by V has no edges (i.e. features only open pairs) then we simply have iy = n^^^ and ty = n~^/^. 
The key point here is that the variable V must not be in (or beyond) the critical interval at the 
moment that we begin the critical interval analysis for that variable. We now use the coupling with 
the Erdos-Renyi process to establish this property for all variables that we track. 

Lemma 3.4. With high probability every tracked variable V satisfies \'DV{iy)\ < {fy + gy). 

Proof. We begin with an arbitrary variable V with the property that the structure counted by V 
has no edge. We show that V{iy) = V{n^^^) = v{l ± gy) with high probability. Let V = X^^j^r, 
using the notation for extension variables from the previous section. Note that we take A = for the 
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global variables Q, i^. We have u (n-^/^) = n"(^)g°(^) = n"(^)(l -0(n'V2)). Since = Lo{Ln-^/^), 
it suffices to show that X^j^r = ± 0(n-i/2)). Note that - F is bounded by 

times the number of paths of length 2 in ER(n, j); call this P2- Now, by Chernoff bounds in ER{n, j) 
whp every vertex has degree pon it (pon)^'^^, which we roughly estimate as (1 + O{n~^'^))pon when 
Po = Using this estimate on degrees we have P2 = (1 + o{l))n{pQn)'^ = (1 + o(l))4t^n^ = 

0(n^/^), as desired. 

It remains to consider variables V = X^^j^y where J 7^ 0. We consider S ffist, as this is the only 
such variable with gv{n~'^^^) < . (Thus this is the only remaining variable with iy = n^/^.) We 
can bound S above by 2jn, as any copy of the structure counted by S determines an ordered edge 
and a vertex. These structures are not counted if the non-edge pairs are either closed or edges. Thus, 
we bound S below by 2jn — 2P2 — 2P3, where P3 is the number of paths of length 3 in G{n,po). By 
the vertex degree estimate we have P3 = (1 + o{l))n{pon)^ , so S{n^^^) = 2n^/^ + 0(n^/^). This is 
well within the desired error bound. 

For the remaining variables V (i.e. stacking or controllable variables) we have gv{n~^^^) > 1. 
Fix such a variable V = X^^j^y- We bound V{iv) above by X^^j in G{n,pQ) and below by 



X. 



0,J 



E 



X. 



4^1 J xy 



in G{n,po), where for xy £ T \ J we define Jxy to be the graph obtained from J by adding a new 
vertex z adjacent to x and y, and for xy £ C2) \^ '^^ define Jxy to be the graph obtained from J 
by adding xy as an edge. 

Firs t su ppose that y is a stacking variable. Considering V one vertex at a time and applying 



3.1 



Vjx 



to each step we obtain X^j = v{l + 0{n ")) for some /? > 0. As 5^ is smaller than 



Lemma 

V hy a polynomial factor for any A Q B C Vj, we get the desired bound on Ylxy ■^'PJxy from an 



application of Lemma 3.3 



Now suppose y is a controllable variable. To estimate X^ j at time ty we first note that since 



L-^ = gvitv) 

we have 



V 



we 



have > L-^n-^l'^. Since ty < 1, for any A C 5 C Vv 



t 



(^J[B]-ej[A] qB 
V 



V 



n 



(1) 



So we can again apply Lemma 



3.1 



to conclude that X(f,j = v(l + 0{n ")) for some /? > 0. It also 



follows from ( 1 ) that for any A <^ B (^Vr the scaling is smaller than w by a polynomial factor 

and we again get the desired bound on Ylxy -^(pJxy 



from an application of Lemma 3.3 



□ 



3.3 Union bounds 



In this subsection we adapt the argument of Lemmas 3.2 and 3.3 to give a crude bound on general 
extension variables that holds throughout the triangle-free process. Along the way, we will prove 
Theorem 1.4, assuming Theorem 2.2 First we need some estimates for Q{i) and Yu^{i) for any pair 



uv that hold throughout the process. 

Lemma 3.5. On Gi, whp Q{i) = (1 ± n-'^/2)g(t) forO<t<tr, 
we have Yujj{i) = (1 it n^^/^)y{t) for n 



Also, whp for any non-edge uv 



-0.4 



<t<tr. 



14 



Proof. The bound for Q{i) holds on the good event Gi for t > n For t < n it holds because 
q{t) = (1 - 0(n-V2))n2, Q{i) > Q{n^/'^) = (1 - 0(n-i/2))„2 q^^ < g(Q) ^ ^2^ 

The bound for Yuv{i) follows from ey < n^^/^ for t > n~^''^^ and e small, so we can assume 
t < n^^'^^. Thus we are in the case n}'"^ < i < n^'^^. For j = 0(n"^'^^), by Lemma 



3.1 



we estimate 

the number of triangles in ER{n,j) by T = 0{j/n)^ = 0{n^^'^^)j. Thus we can couple G{i) to 
ER[n,j) where j = (1 + 0{n~^''^^))i. Writing d{v) for the degree of v in ER{n,j), by Chernoff 
bounds whp ci(f) = 2j/n + 0{j/nf'/^ = (1 it n~'^-^^)y. This gives the required upper bound on 
Yuv{i)- We also have Yuv{i) > d{v) — T(v) — P3{uv), where in ER{n,j) we write T{v) for the number 
of triangles containing v, and P3{uv) for the number of paths of length 3 from u to v. Each of T{v) 



and P3{uv) have scaling n^p^ < n~^/^ <C 1, so by Lemma 3.3 whp both are at most < n~'^'^^y, 
and we also have the required lower bound on Yuv{i)- D 
We need some further notation and terminology for general extensions in the triangle- free process, 
which mirrors that used previously for extensions in the Erdos-Renyi process. We say that {A, J, T) 
is strictly balanced at time t if < 1 for all ^ C B C 1/p. The extension series at time t for 
{A, J, r), denoted {Bq, . . . ,Bd), is constructed by the following rule. We let Bq = A. For i > 0, if 
{Bi, J, r) is not strictly balanced then we choose Bi^i to be a minimal set C with Bi C Vr that 
minimises 5^.; otherwise we choose -Bi+i = Vr, set d = i + 1 and terminate the construction. 



In Lemma 3.8 we will give a general estimate for extension variables in the triangle-free process. 



First we illustrate the argument in the following lemma, which shows that sparse graph pairs do 



not appear; this is the main tool needed for the proof of Theorem 1.4 Here we take A = 0, write 



Vj^r = Xfpj^Y^, where (p is the unique map from to [n], and vj^r = S^^{J,r). 

Lemma 3.6. Suppose vj,r(i') < for some c > and time t' . Then the probability that Qi holds 
for i < i' but Vj^r(^') > is at most 2n~'^. 

Proof. Suppose first that t' < L^^. Here we appeal to the coupling with the Erdos-Renyi random 
graph process. It suffices to estimate the probability that J appears in Gnj, where j = (1 + o(l))i' . 
The expected number of copies of J is at most 2n~^, so the required bound follows from Markov's 
inequality. Thus it suffices to consider t' > L^^. 

To estimate P(Vjj(i') > 0), we take a union bound of events, where we specify the injection 
f : Vr ^ [n], and for e G J we specify the selection step ie at which the process selects the edge /(e). 
Fix some choice and let £ be the specified event. For each i < i' we estimate the probability that the 
selected edge is compatible with S. At a selection step i = ie the selected edge is specified, so the 
probability is 2/Q{ig) = (1 + o(l))2g(te)~^, where te = n^^/^ig. For other z, the required probability 
is 1 — Ni/Q, where Ni is the number of ordered open pairs that cannot be selected at step i on £. If 
z is a selection step we write Ni = 0. Then we estimate 

i' 

< 11(1 + o{l))2q{ter' . J](l - N,/Q). 

eeJ i=l 

Now we estimate Ni when i is not a selection step. For i < L~^n^/^ we use the trivial estimate 
Ni > 0, so suppose i > L~^n^/^. Suppose there are ki choices of e G J with ie > i- Then there 
are |r \ J| + ki open pairs that must not become closed, namely the open pairs of /(F \ J) and 



15 



the ki pairs that have yet to be selected as edges. Note that the number of choices of Cj that close 
more than one such open pair is 0{L^) = o{y). By Lemma 3.5 all y-variables are (1 + o(l))y, so we 
obtain Ni = {I + o(l))(|r \J\ + h) ■ 4y. Thus for i > n^-^ we can write 1 - Ni/Q < {I - A., - Bi), 
where = (1 + o(l))|r \ J| • Stn"^/^ a,nd Bi = {I + o{l))ki ■ Stn"^/^ This holds for all i if we set 
Ai = Bi = for i < L-^n^l'^. 

Using the estimate ^(1 — Ai — Bi) < exp {—Yl^i~Yl ^i} '^^ see that the contribution to F{£) 
from the first factor is 

expJ-^^A =expi -(l + o(l)) |r \ J| • 8tn'3/2 

I 1=1 J [ j=L-in3/2 

' i' 

= exp^ -(l + o(l))|r\ J| -Sin-^ 

= (1 + o(l)) exp {-|r \ J| • ^(i fn~'^} 
= (l + o(l))e-4(*')'|rVI = (l + o(lM0l^\'^l 

since ^^i" ^ m^^ < = o(l). From the second factor we obtain 

exp|-^Sil =exp| -(l + o(l)) ki-Stn^^^"^ 

-(l+o(l))^ Y 8m- 
= ll{l + o{lMQ. 

eeJ 

Therefore 

IP(0 < (1 + o(l)) n 2?^"^ • Qit'f^-^^- 

eSJ 

Summing over at most n}^^^ choices for / and (z')'"'! choices for the selection steps, we estimate 
P(Vj,r(i') > 0) < (1 + o{l))v{t') < 2n-^. □ 
Proof of Theorem 1.4\ Statement (i) is immediate from [S'j Theorem 1.6(iii)]. For (ii), H' C H 
with d2{H') > 2. We can assume that the global parameter e satisfies \Ehi\{1/2 — e) > \Vh'\ + e. 
Note that if i/ C G then Vj^H'iimax) > for some spanning subgraph J of H' . We have 



Thus the result follows from Theorem 12.21 and Lemma 13.61 □ 
Lemma 3.7. // 5]^^ < y/L'^ for allACBOVr at some time t' then whp 



X<.jr(0 < i^'^^' max S^^ 



for all injections (j) : A ^ [n]. 
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Proof. It suffices to consider t' > L as for smaller t' we can simply appeal to the coupling with 



the Erdos-Renyi random graph process and apply Lemma 3.3 



The argument is similar to that of Lemma 3.6 We write s = max^cscVr ^'^d estimate 



the probability that there is a set of s extensions {/i, . . . , fs} in V{i') := X^pj^rii') that are disjoint 
outside of </>(A). We take a union bound of events in which we specify fi,...,fs, and for each 
1 < i < -s and e £ J \ J[A] we specify the selection step ij^e at which the process selects the edge 
fj{e). Fix some choice and let £ be the specified event. For each i < i' we estimate the probability 
that the selected edge is compatible with S. At a selection step i = ij^e the selected edge is specified, 
so the probability is 2/Q{ij^e) = (1 + o{l))2q{tj^e)~^ , where tj^e = n~^/'^ij^e- For other i, the required 
probability \s 1 — Ni/Q, where Ni is the number of ordered open pairs that cannot be selected at 
step i on 8. If i is a selection step we write Ni = {). Then we estimate 

s i' 

m<Yl n {i + o{i))2q{tj,e)-'-ll{i-m/Q). 

i=i ee J\J[A] i=l 

Now we estimate Ni when i is not a selection step. For i < L^^/^n^/^ we use the trivial estimate 
Ni > 0, so suppose i > L^^^'^rt'/'^ . Suppose there are ki choices of (j, e) with ij^f. > i. Then there 
are o{V)s + ki open pairs that must not become closed, namely the o{V)s open pairs specified by 
fi, ■ ■ ■ , fs and the ki pairs that have yet to be selected as edges. Note that the number of choices 
of Si that close more than one such open pair is O(s^L^) = o{syL~^), since s < y{t')L~'^ < yL~^'^ . 



By Lemma 3.5 all ^-variables are (1 + o{l))y, so we obtain Ni = [1 + o{\)){o{V)s + ki) ■ Ay. Thus 
for i > L"V2„3/2 ^g^^ I _ j^.jQ < 1 _ ^. _ ^.^ where Ai = {I + o{l))o{V)s ■ Stn"^/^ and 

Bi = {l + o{l))ki • 8tn~3/2_ This holds for ah i if we set Ai = Bi = ior i < L^i/^n^/^. 



Similarly to the proof of Lemma 3.6, the contribution to P(£') from the first factor is 

i+o{imtr^''^Y , 




exp < 

and from the second factor is 11^=1 neeJ\J[A](l + ^(l))9(^j,e)- Therefore 



^2 



¥{£)<q{t')<''>l[[{l + o{l)) n 

i=i V eeJ\J[A] 

Summing over at most s!~"^n"^^^* choices for /i, . . . ,/s and (i'Y^^^^ choices for the selection steps, 
the probability that such /i,...,/<j exist is at most s!~^[(l + o{l))v{t')Y < {3s~^v{t'))^ , which is 
subpolynomial. 

In the case when (^4, J, F) is strictly balanced at time t' , the required bound on X^^j^p{i') follows 



by induction as in the proof of Lemma 3.2 The general case follows by applying this to each step of 



the extension series. □ 

Lemma 3.8. Whp for any extension (A, J, F) on at most vertices and all injections cp : A ^ [n] 
we have 

X^..,rm < L'l^^l max S^^ . 



acbcVt 



r 
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Proof. Let A = Bq, . . . , Bf = Vrhe the extension series at time t. For each step {Bi, J[Bi^i],T[Bi^i]) 
there are two cases. If S^'^^ (t) < then we can apply Lemma |3.7 to bound the number of ways 
to make the step. On the other hand, if S^'^^^{t) > then this extension variable is controllable at 
time t, and we bound the number of copies by Lemma 5.1 □ 



4 Global Ensemble 

In this section we prove that the global variables have the desired concentration, assuming that 
this is the case for all ensembles at earlier times. Recall that we track each variable V relative to 
a tracking random variable TV to isolate variations in V from variations in other variables that 
might have an impact on V . Recall that the global variables have scalings q = qv?, r = q^n^ and 
s = pq'^n^ = 2t(^rc'l'^ . We use the tracking variables 

TQ = q, TR = Q^n-'^, and TS = 2tn-^/^Q'^ . 

(Note that the tracking variable for Q is in fact a deterministic function.) We show that the difference 
random variables 

VV = V -TV 

for V G {Q, R, S} are all small throughout the process. Recall that Ii is the minimum of the stopping 
times ly over all variables V in the global ensemble, so the following theorem says that there is a 
very small probability that we reach the universal stopping time / before step imax 

because a global 

variable V fails to satisfy the required bounds \T>V\ < eyv. (Note that a global variable cannot be 
bad.) 

Theorem 4.1. P (/ = /i and h < imax) = o(l). 



We organise the proof of Theorem 4.1 into three subsections, in which we respectively verify the 



trend hypothesis, variation equations and boundedness hypothesis. 



4.1 Trend Hypothesis 

In this subsection, for each variable V in the Global Ensemble, we give an upper bound on the 
one-step expected change in the difference variable, namely 

E[AiVV I Ti] = E[VV{i + 1) - W(i) I Ti], 

under the assumption that V is in its upper critical window, i.e. 

{fv + 9v)v <VV < ifv + 2gv)v. 

We consider the effect of each open pair and edge in the structure counted by V separately; the final 
expression is then obtained by linearity of expectation. When an open pair in a copy of the structure 
counted by V is chosen or closed, we say that the copy is destroyed. We balance the change in V 
due to destructions with the change in TV due to the change in Q. (The case V = Q \s handled 
differently as Q is tracked relative to the deterministic function q.) Adding the edge e^+i can also 
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create new copies of the structure counted by V in which e^+i plays the role of one of the edges in 
the structure; then we say that a copy of V is created. The change in V that comes from creations 
is balanced with the change in t in TV. 

To illustrate this mechanism consider V = Yuv, which we do not treat in this section but is a 
good example here. We write Yuv = Yuv ~ where Yuy counts the number of creations and Yuy 
destructions up to time i. Recall that TYuv = 2tQn~^/^. We define Ai{TYuv) and Ai{TYuv) by 
writing 

AiiTYuv) = TYuv/itn""^) - ^^TVuv + 0{t + t'^)yn-^ 

= MTY^,) - MTYi) + 0{t + r^)yn-\ 

Now we can write 

Ai{VYuv) = MVY^,) - MVYi) + 0{t + t-^)yn-^ 

where Ai{VY^,) = A,(y„-J - A,(7%^) and AiiVY^) = A,{Yt) - A,{TYl) . In this way we balance 
any change in Yuv with a corresponding changes in TYuv 

In general, for a variable V with underlying graph pair (J, T), we use the notation 

^ = E^^- E ^^ 

eeJ eer\j 

where for edges e S J we let count the number of creations of V up to time i by the selection of e, 
and for open pairs e G T we let count the number of destructions of V up to time i by the closure 
or selection of e. (Actually, these are approximate expressions because a given copy of V may be 
counted more than once, but we will see that this gives a negligible error term.) We decompose the 
changes of the tracking variable as Aj(TV) = X^eeJ^^C^^^) ~ Z]eer\j ^iC^^*^)' where A.i{TV^) 
is the change in some term in TV that corresponds naturally to e (there is often an additional error 
term). We then use A.i[TV^) to balance Aj(y^) for each e. Thus we write 

A,(W) = J^Ai(W^)- Ai(W^), 
eeJ eer\j 

where Ai(W") = Ai{V^) - Ai{TV^). 

Now we proceed to compute the impact on K[Ai(T>V) \ Ti] for each open pair and edge in the 
structure counted by global variables V. We begin with destructions. The main point to note in 
these calculations is that each open pair gives a self-correction term of —8tfyvn~^^'^, which will later 
balance a corresponding Stfyvn"'^/'^ term from the change in eyv. 

4.1.1 Simple destructions 

The variable Q has the property that there is a larger extension that describes the situation when 
an open pair ah in Q is closed. We call destructions of this form simple destructions. (We will see 
examples of this type again in Section [6] where we treat the stacking variables.) Note that each triple 
in 5 describes a way in which the addition of an edge closes a pair, so we have 

KlAiQ"^ \Fi]=2 + ^S/Q. 
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Then 



E[Ai(PQ) I F^] = E[A,(Q) - Ai(g) | Fi] 

= -(2 + AS/Q) + 8tqn-^/^ ± d{qn-^) 

= -8tQn~^/^ ± (8 + 0{eQ))estn^^^q + Stqn'^''^ ± 0(1) 

< -Uq + 5q - (1 + o(l))es)8tgn-3/2. 

In the last hne we used VQ = Q — q > {fq + gQ)q when Q is in its upper critical window and 

tesqn~^/'^ > cse'^qn~^/'^ = C5 » 1. 

4.1.2 Product destructions 

For destructions, it remains to consider those of R and S. These are not simple destructions, as no 
variable in the ensemble counts ways in which these structures are destroyed by the closure of open 
pairs. Instead we will apply the Product Lemma from Section [2j Let y be i? or 5" and let ab be 
an open pair in V. For each open pair a/3 in G{i) let V^^ be the number of copies of the structure 
counted by V that use a as a and /3 as b. So Ra/3 = Xap and Sap = Y^p- We note that 

X] ^ap = S and ^ Xap = R. 

aP&Q aP&Q 



So by the Product Lemma we can write 
E[Ai(W^) \Fi]=E 



Mv") - ^^rv I F 



= 2Q-HYap + YpaWap - (2 + 4SQ-^)Q-^TV 

= 4SQ-^V ± O {eyyev^^vap) - ASQ-^TV ± 0{v/q) 

= (1 ± (1 + o(l))e5)8tn-3/2py _t 0(eyev„^)tvn-3/2 ± 0{v/q) 

> [(1 + o{l)){fv + gv) - 0(eyey„,) - ©(r^e^)] Stvn-^l\ 

This calculation holds for every open pair in the structures counted by R and 5, but we also need a 
'destruction fidelity' term to correct for simultaneously closing the two open pairs in a single copy of 
the structure counted by V . Note that this cannot occur for 5, as the configuration for closing the 
two open pairs in S would make a triangle with the edge in S. For R we need to estimate the number 
of i^4's in which two adjacent pairs are edges and the other four pairs are open. By Lemma [3. 8| there 
are at most L^^n^p^q^ = 0{L}^t^qr) such configurations, so the correction to E[Aj(Pi?'^) | Fi] is 
0{q^^ ■ t^L^^qr) = 0{L}^t^rn~'^) = o{gR)trn~^/'^ , assuming (say) 



CR>L 
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4.1.3 Creations 

Finally, we turn to creations, which among the global variables occur only for S. Let V = S and let 
ab be the edge in V. Recall that V'^{i) denotes the number of creations of V through i steps of the 
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process (which can only occur through the addition of an edge that plays the role of ab) . Let be 
the structure given by replacing the edge ab with an open pair, so = R. Note that 

For each extension in V~^{i) the probability that the edge Cj+i falls in the appropriate position to 
result in a creation counted in Aj(y^) is 2/Q{i), so we can write E[A.j(y^) | Ti] = 2Q^^V^. There 
is no 'creation fidelity' term, as it is not possible for the added edge to close an open pair in V^{i). 
We match this change in V with the change in TV that results from the one-step change in t, which 
is Ai(rF^) = TV/{tn^/^). We have 

= 2Q-\TV+ ± v+ev+) - r^n-^l'^rV 
= ±{l + o{l))t-^ey+vn-^''^ . 

4.2 Variation Equations 

Gathering together the relevant creation and destruction calculations from the previous subsection, 
under the assumption that the variable considered is in its upper critical window, we have 

E[AiPQ|JP;] < -(/Q + 5Q-(l + o(l))e5))8^gn-3/^ 

E[AiPi? I Ti] < -(1 + o(l)) [3(/ij + qr) - Oieyex) - Oir^e^)] Strn-^/^ 

n^iVS\Fi\ < -(l + o(l)) [2(/5 + 55)-g|-0(eyey)-0(t-V)]8tsn-3/2. 

For each variable V in the Global Ensemble we consider the sequence of random variables 

ZV{i) = VV- vey. 

Under the assumption that V is in its upper critical window, we show that this sequence is a 
supermartingale, i.e. that 

K[AiZV I 7i\ = E[Ai(PF - vev) \ H < 0. 

Note that 



Ai{evv) = ev{t + n-^/^)v{t + n'^/^) - evit)v{t) 

= {eyv'/v + ey)fn"^/^ + 0{ey + e'yv'/v + eyv" /v)vn 
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We estimate the last term by 0{t^ + 1 '^)evvn ^ = 0{evvn ^/^), using the fact that we restrict our 
attention to t > n~^/^. We write 

v'/v = t-^e{V) - 8to{V), 

where V has e{V) edges and o{V) open pairs not in A. Recalling that ey = fv + 2(/y, we see that 
we can cancel the 8to{V)fvvn~^/'^ term that occurs both in Ai{evv) and in E[AjI?F | J^i]: this is 
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the self-correction that is fundamental to the analysis. Thus we obtain 



8t 



+ o(/q) - (1 + o(l))(5Q + es) Stqn-^/'', 



ElAiZRlJ'i] < -(^^ + o{fii)-{l + o{l))i3gR + OieYex))-Oit-^e^)^8trn-^/^ 

E[AiZS < - (^^ + g + oifs) - (1 + oil)){2gs + || + O(eyey)) - Oit'^e^)^ Stsn'^/^. 

Wc now show that the error functions that wc have chosen grow quickly enough for each of these 
sequences to be supermartingales (i.e. the e'y term will be dominant in each case). We stress that 
the t <C 1 regime behaves a bit differently from the rest of the process in the estimates that follow. 
Now recall that our error functions have the form 

fv = cye^ and gv = cv^L'^il + t-<^^)e'^. 

Recall further that e = q~^/^n~^^^ and note that the power of t in gv is chosen so that the dominant 
term in vgv as t ^ does not contain a power of t. Since e'/e = 4t we have 

fi^/fv = 8t and g'y/gv = ^'/^-e{V)t-\l + t<^^)-^+8t. 

Therefore, we have 

e'v > 8tfv + {'&'/^ - e{V)t-^ + 8t) ■ 2gv. 

Wc apply this observation to each global variable in turn. 
For Q we have 



8tqn~^/'^ 



n^iZQ\Ti] < -(l + o{l))[{fQ + i^ + 2)gQ)-igQ + es 

< -a + oii))[{fQ-fs) + ii^gQ+gQ-2gs) 

Then the sequence ZQ forms a supermartingale provided 

CQ > 2cs. 

(Note that ^gq dominates the remaining terms for t ^ 1.) 
Next consider R, where we have 

K[AiZR I Ti] < -(1 + o(l)) Ifn + {^ + 2)-gR- 3gR - O(eyex) - Oit'^e'')] 8trn-^/^ 



< -(1 + 0(1)) [fR + (1^ - l)gR - Oifyfx) - Oigygx) 
Then the sequence ZR forms a supermartingale provided 

CR > Lcycx- 



For this implies that the gR'd'/{4t'd) term dominates for t ^ 1 and that the /r term dominates 
otherwise. 
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The final global variable is S, where we have 

E[AiZS\T,] < -{l + o{l))[{l + ^)fs + {^ + 2)-gs\8tsn-^/^ 

+(1 + o(l)) [(2^5 + + O(eyey)) + 0{t-'e^)] Usn-^'^ 
< -(1 + o(l)) [fs + i^ + ^''^sM-^- - 0{fy) - 0{gl) + o{gs)] Stsn-^^ 

Then the sequence ZS forms a supermartingale provided 

cs > 2cij and cs > Lcy- 

For this implies that the fs/t^ term dominates for t ^ 1 and the fs term dominates otherwise. 

4.3 Boundedness hypothesis 

To apply Freedman's inequality, for each V in the Global Ensemble we estimate Vary = Var{ZV{i) \ 
Fi-i) and Ny = \ AiZV\] recall that it suffices to show 

^ {awY gyv 
Vary < and Ny < —r-. 

For convenience we replace ZV by DV in our calculations, as this does not change Vary and only 
changes Ny up to a constant factor. We can estimate the contribution to the variance from each pair 
in V separately, using the simple observation that if random variables A and B each have variance 
at most V then A + B has variance at most 4f . 

Here we estimate Vary < Ny, i.e. for the global variables we simply apply the Hoeffding- 
Azuma inequality. For Q we have gqq > cgL'^n^^"^ , so it suffices to show Varq < CqL^^n^^'^ and 

^ cqL~'^v?/'^ . The change in VQ when the process chooses the edge Cj+i = uv is 

/\iVQ'^ = 2{Yuv + Y^u + 1) - {q{i + 1) - qii)) = Oiyey) = OicyLn^''^). 

Then Nq = 0(?7-^/^), and the required bounds hold easily. 

For R we have gar > CRL'^qpn^/"^ , so it suffices to show Varji < cj^L~^q'^n'^/'^ and Nji < 
CRL~'^(f'n^/'^ . On choosing Cj+i = uv, for each open pair in R we have 

^iVR'' = ^iR''-^^TR = 2 Yl iXab-Q^n~^) = diq'^/^n'/'). 

ab£YuvUYyuU{uv} 

There is also a destruction fidelity correction for closing two open pairs in a single triple counted by 
R, which occurs for open triples uab such that va and vb are edges, and similarly interchanging u 



and V. By Lemma 3.8 the number of such choices for at is O^nq"^), which is negligible in comparison 
with Ai'DR'^. The required bounds hold easily. 

For S we have gss > csL~^qv? , so it suffices to show Vars < c^L~^q'^n^^'^ and Ns < csL^^qn^ ■ 
On choosing ej+i = uv, for both open pairs in S we have 

A,VS' = - = 2 J2 (^«^ - 2iQn-3/2) = 0{y • eyy) = d{f/'n'/'). 

abeYuvUYvuUiuv} 
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Also, for the edge in S we have creation 

A,VS^ = AiS^ - TS/{tn^/^) = 2Xab - Q^n-^ = 0{exx) = d{q^l'^n^'^). 
The required bounds hold easily. 



Having verified the trend and boundedness hypotheses, the proof of Theorem 4.1 is complete 



5 The Controllable Ensemble 

In this subsection we formulate a general condition under which we can control the variable V = 
J r with accuracy . Let t' > \. Recall that V is controllable at time t' \i J ^ T and for any 
1 < t < we have 

Sa{J, r) > for all ^ C 5 C Vv. (2) 

Note in particular that these bounds are required to hold for v = {J, F). Note also that if V is 
controllable and is obtained from V by changing an edge to an open pair then is controllable. 

Recall that I2 is the minimum of the stopping times ly over all variables V in the controllable 
ensemble. The following theorem says that there is a very small probability that we reach the 
universal stopping time / before step imax because a good controllable variable V fails to satisfy the 
required bound \W\ < eyv. 

Lemma 5.1. P (/ = I2 and I2 < imax) = o(l). 

It is convenient to consider the following modified extension variable. Given an injective map 
f : Vr ^ [n], we say that a pair ab in /(Vr) is f-open if there is no vertex c such that ac, be are edges 
and c ^ f{Vr); note that it is the last condition that distinguishes the definition from that of 'open'. 
Let V* = X'^ jr(^) be defined in the same way as X(j)^jj-{i), except that pairs that are required to 
be open in X^^j^^ii) are only required to be /-open in X"^ jr(^)- We will show that V = V* ±gvv/2 
and V* = {1 ± ev*)v, where ey = ey — 5^/2. 

First consider the approximation of V hy V* . Fix e G (^^) \ F with e not contained in A. Let 
J'^ = J U {e} and F^ = F U {e}. We can estimate |F — y*| by the sum over all such e of Xtpje j^e. 



By Lemma 3.8 we have X<^,je,r= < L'^\^^\S^^{J'',r^)/S^{J'',r^), where B is chosen to minimise 
5f (J%F^). Note that 5^^(J^F'^) = pv and 5f (J%F'^) > pn^ by controllability. Thus X<^,je,r= < 
2^4|Vr|^j^-'5^ which is at most gvv/2 for n sufficiently large. Here we recall that ey = /v + 2gy has 



the form described in Subsection 2.4 namely 



fv = e^ and gy = t^L'^I + t-'^(^))e^ 

and note that e^ > n~^^^. 

To estimate V* we will apply Preedman's inequality to ZV* = V* — TV* — {fv + 3gv /2)v, where 
TV* = n"(^)p^(^)(Qn-2)o(V') (under the assumption that VV* = V* - TV* is in the upper critical 
window). We begin with the boundedness hypothesis because it requires a delicate argument that 
uses ([2]) in a crucial way. 
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5.1 Boundedness hypothesis 



Given gy as defined above, it suffices to show 

^"'''^ ^ ^5^3/2 ^'^^ < • (3) 

Note that the one-step change in TV* + {gy + fv)v is 0((i + t'''^)vn^^^'^) = 0(fn~^/'*), which is 
neghgible in comparison with the required estimates. Therefore, it suffices to consider changes in V* 
rather than ZV* . 

First consider the one-step change due to closing some e = a(3 G F \ J. Assume for the moment 
that \Vt\ < M^. (The case l^rl = is treated below). Let (J',F') be obtained from (J,F) by 
'gluing a y-variable on a/3' as follows. Let 7 be a new vertex, y = Vp U {7}, J' = J U {13^} and 
F' = F U {q;7, /37}. Note that this definition depends on the order of a and /3. We analyse closures 
of e by adding the edge corresponding to 07. Let A' = A^J {a, 7} and = niin^/cscy 5'a ) where 
all scalings are with respect to (J',F'). We can bound the probability pe that ^0,j,r is affected 



by closing an edge corresponding to e hy pe < L^I^ 'Sm/g, using Lemma 3.8 and the assumption 
< M^, as this event is contained in the event that the new edge is contained in an extension to 
the graph induced by the set 5m that defines 5m- Furthermore, we can bound the magnitude of the 



change in V* by Ne < L^l^'l • L'^^^'^S^' /S^, applying Lemma 3.8 twice. Then the one-step variance 
in V* satisfies 

(Note that we use S^' = yS\ = yv.) Now recall that 5m = and write 5^- = ^^-^^Sj"^^^. We 

have S^-^^ > y by construction of (J',F') and 5^™^''^ > since V is controllable. Thus 5m > n^y, 
and we have 

Vare = Oin-^v'^n-^''^) and = d{n-^v), 

which gives the required bounds provided that n is sufficiently large. (Note that > n~^^^.) 

We now turn to the one-step change due to destruction in the case \Vr\ = . In this case 
we cannot necessarily apply Lemma 3.8 as the extension we get by 'gluing a y-variable on a/3' has 

+ 1 vertices. Here we modify the argument slightly, getting the same bounds as above. First 
note that the bound on pe can be used exactly as above. If Bya has + 1 vertices then first take 
the extension to \ {7} and then extend to 7; this gives the same estimate as above because V is 
controllable. The bound on N^. is slightly more delicate. Consider the last step in the extension series 
for the extension from to V in the argument above. Let this extension be (C, J',F'). If /3 G C 
then the vertex 7 can be dropped from C and Vy' without influencing the extension. This gives an 
extension with at most vertices and Lemma 3.8 applies to give the same bound as above. So we 
can restrict our attention to the case /3 C. Note the extension from C to CU{/3} contains the e dge 



3.7 



/37 and the open pair a/3. So 5^' < S(^^ < y- Now, if 5^' < y/L'^ then we can apply Lemma 
directly to get the desired bound. On the other hand, if 5^' > yjV then S^^^^ > y/L^, and in this 
case we can bound the number of extensions from C to V' by first bounding the number of extension 
from C to C U 1/3} by Yq,^ and then bounding the number of extensions from C U {/?} to V by an 
application of Lemma 3.8 (using the fact that 7 would play no role in such an extension). In every 
case we sacrifice at most a polylogarithmic factor in the bound on A'^e, and the estimates above hold. 
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Now consider the one-step change due to creating a copy of this extension. Suppose that e G J. 
Let e = a/3 and A' = AU {a,P}. Define = min^/cBcVr where all scalings are with respect 
to (J \ e, r). Similarly to the case e E F \ J, we can estimate the probability pe that V is affected 
by choosing a pair corresponding to e hj pe < L^^^^^Sm/q, and the magnitude A^'e of the effect by 
Ne < L^l^rl . L^\Vr\s^^/S^. Here we have 

Var, < p^Nl < L''^\^\SJq){S'^/Sr^f = L^'\''\q/pfv' / {qS^). 

Since {J,T) is controllable we have 5m > n^iq/p), and we have 

Vare < L^°^^'^n-^v^/{2tn^/^) and Ne = d{n-^v), 

which gives the required bound ^ for n sufficiently large, since we must have e{V) > 1 in this case. 

We have verified the boundedness hypothesis, so Lemma 5.1 will follow from Freedman's inequal- 
ity, after we establish the trend hypothesis in the following subsection. 



5.2 Trend hypothesis 

Assume V = ^0,j,r is controllable and V* = jp is in the upper critical window: (/y + gv)v < 
VV* < {fv + 3gv/2)v. We give an upper bound on E[AiDy* | Ti], i.e. the one-step expected 
change in the difference variable. To organise the computations we gather terms according to each 
edge e of F. Those with e G J contribute positively (creation); those with e G F \ J contribute 
negatively (destruction). We write Ai{V*) = V*{i + 1) - V*{i) = EeeJ^*(^'') " Eeer\j ^il^'') 
and Ai^V) = A*{V'') ± Fj(y^), where A*(y^) is the main term and Fi(F^) is the 'fidelity' error 
term. For e G J we take Ai{V'^) = A*{V^) equal to the number of / G X^j^^^-^{i) such that /(e) is 
added at step i + 1. We can take the 'creation fidelity' term as Fi{V^) = because we are tracking 
V* rather than V. For e G F \ J we take A*(y^) equal to the number of / G jr(^) such that /(e) 
is selected as e^+i or closed at step i + 1. The 'destruction fidelity' term Fi{V'^) bounds the number 
of such / such that /(ej+i) also closes /(e') for some e' G F \ J with e' ^ e. 

Recah that we write Ai{TV) = TV{i + 1) - TV{i) = Y.e& ^i{TV^) + 0{t^ + t-'^)n-^v, where 
Ai{TV^) is ^ for e G J or ^^TV for e G F \ J. We also write 

Ai{VV) = VV{i + 1) - VV{i) = ±A,{VV^) + d{n-^/^v), 

eer 

where Ai(py^) = Ai{V^) - Ai{TV^) ± Fi{V^). 



5.2.1 Creation 

Fix e G J (if J / 0) and write = X^,j\e,r- Then 



E[Ai(W^) I Ti] = n^i{V') - A,{TV') I Ti] 
= 2Q-^V-^ -TV/{tn^''^) 
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5.2.2 Destruction 

Fix e G r \ J. We iiave E[Aj(l^^) | Fi] = 2Q-^ E/6y(l + <^/(e) ± L'^)^ where bounds tlie number 
of open pairs tiiat close /(e) and also f{e!) for some other e' G F \ J (i.e. this is the destruction 
fidelity term). Then 

'^i{Q)TV 



E 



Q 



= (1 ± ey)8tra-^/V ± 0{L^q-\) - (1 ± 0{es))8tn-^^^rV 
= Stn-^/'^VV ± Stn-^/^eyV ± 0{es8tn-^/^v) ± 0{L*q-^v) 
> [fv + 5y - (1 + o(l))ey - 0{es) - O(LV^e^)] 8tvn-^/^. 

Note that /y, = o(/y) and gs, LH~^e^ = o{gv)- So we have the bound 

¥.[Ai{VV^) I Fi\ > (1 + o(l))(/y + 5y - 23^)8^^-=^/^ 

5.3 Veiriation Equation 

As for the Global Ensemble, 

Ai{ev*v) = ey. (t + n-'^l'^)v{t + n"^/^) - ey. (t)t;(t) 

= {ev*v' /v + ey)un"^/^ + 0{ey + e'yi;'/^ + ey.i;'7^')^"~^> 

where we estimate the last term by 0{t^ + t'~'^)ev*vn~^ = 0{ev*vn''^^^). Note that 

e'v* = 4Stfv + (t?7i? - e{V)t-\l + f^^Y'^ + ASt) ■ 3gv/2. 

Since cy = 1 for all V in the Controllable Ensemble, we have ey+. — ey. = {3/2){gy+ — gv)- Note 
further that gv+ < 2tgv (and there is no term if e{V) = 0). Applying these observations we 
have 

E[Ai{ZV) I Fi] 



< (1 + o{l)r-^ev+* - (1 + o(l))o(y)(/y + 5V - 25y) 

- oiV))ev* + + 0(ey*r^n-i) + d{t-\-^) 
<'^-U9v^-9v) + "-^9v-Wv 



8t^ 
-( 



8ti? 



e(y) 



+ 



39k 



+ Oigv) + o{ev+ /t^) + oiev) 



< f (o(y) - f + Mp _ 3^) _ 15/^ + o(5y) + o{ey^/t') + o(ey). 
Then E[Ai(Zl/) | F] < 0, as —^gv dominates for t <C 1, and — gJ/y dominates otherwise. 



6 Stacking ensemble 

In this section we establish control of all variables in the stacking ensemble. Recall that I3 is the 
minimum of the stopping times 7y over all variables V in the stacking ensemble. The following 
theorem says that there is a very small probability that we reach the universal stopping time / 
before step imax because a good stacking variable V fails to satisfy the required bound \T>V\ < eyv. 
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Theorem 6.1. P(/ = /a and I3 < imax) = o(l). 

Before proceeding with the trend and boundedness hypotheses and variation equations, we discuss 
the extensions that require appeals to the controllable ensemble. 

6.1 Subextensions of stacking variables 

This subsection concerns certain subextensions of stacking variables that will be needed in the trend 
hypothesis. In two very important cases we will appeal to the Controllable Ensemble for our es- 
timates, and so wc need to show that these extensions are indeed controllable. The two special 
structures that we need to consider are the following. 

• An h-fan at the triple A = abc, also called a ^i^-extension at is any extension of the form 
(A, J, r), where the base A = abc, Vy has h additional vertices fi, . . . , w/j, hvi . . . v^c is a path 
of length h + 1 in F, and avi G F \ J is open for i G [h]. We emphasize that the pairs in the 
path bvi . . . VhC can be either edges or open pairs. 

• Let {uv,J,r) be the extension corresponding to some stacking variable vr G Sm such that 
w{Tr) = 2M. Note that if axCty is the last rung of tt then the extensions we get by stacking 
another copy of Y on axay are not in Sm- The backward extension B-^ is the extension 
{A' , J', r') with A' = {au, ay, ax, ay}, J' = J and F' = F \ Uxay. 

We now show that if we set 

h = M = Z/e 

then both of these extension variables are controllable at the time tmax when q{tmax) = n""^/^"*"^. 

We begin with the /i-fan. The minimum scaling of a $/i-extension is {qn)^p^^^ > rf^~^/'^{2traax)^^^ , 
which is achieved when the path belongs entirely to J. Fix B with A C. B C. V that minimises 

= S^{J, F). We need to show that > . We can assume that B 7^ Vr- Then we can find Vi 
in B such that not both Vi-i and Uj+i are in B. (Here vq = c and f^+i = b.) But removing Vi from 
B reduces the scaling by at least y > . Thus the minimum scaling is achieved by a single vertex 
extension of A, which is at least y > . 

Now consider a backward extension B^^ with w^n) = 2M, and fix B with A' C. B C. V. We 
calculate S^, going one vertex at a time. If B does not contain some vertex ai such that 7r(z-|- 1) = O 
or does not contain at least one vertex from each rung then by counting from the ends we see that 
S^/ > (n^)!-^!"!"^ L On the other hand, if B indeed contains every ai such that 7r(i + 1) = O and 
intersects every rung, then we claim that \B\ > M + 2. To see this, first note that the number of 
vertices in B in the interior of each triangular ladder must be at least half the number of turning 
points in the ladder rounded down. If vr has i occurrences of the symbol O then there are at most 
i + 1 triangular ladders and tt has 2M — 2 — i turning points that are on the interior of ladders. It 
follows that 

2M-2-i i + l „ 

\B\A'\>i+ > M - 2. 

I \ I - 2 2 - 

With this observation in hand, simply counting in the stacking order we have 
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6.2 Trend hypothesis 

Fix a stacking variable V = S^^. Assume V = ^0.,/,r is in the upper critical window: (/y + gv)v < 
DV < (fv + 2gv)v. We give an upper bound on E[AjDy | J^i], i.e. the one-step expected change in 
the difference variable. To organise the computations we gather terms according to each edge e of 
the underlying graph pair (J, T) of tt. Edges of vr contribute positively (creation); open pairs of tt 
contribute negatively (destruction). We write 

Ai{V) = V{i + 1) - y(i) = ^ \{V^) - ^ AiiV% 

eeJ eer\j 

where each Ai(F^) includes a 'fidelity' error term that accounts for the selection of edges that impact 
a given extension in more than one way. 

Consider the case that |7r| > 1, and vrdvrl — 1) ^ O or vrdyrj) € {0,E}. We write vr = tt~U, 
V~ = S^^ and recall that TV = V^TU. We say that a pair e is terminal if it belongs to U, 
i.e. it contains the final vertex of V; otherwise we say that e is internal. For ease of notation, if 
|7r| = 1 then we write U = V, and let V~ be the variable that is identically equal to 1. We write 
AiiTV) = Ai{V-)TU+V-Ai{TU) = EeeJ ^i('^')-Ee6r\j where Ai(TV^) is defined 

as follows. 

(i) If e is a terminal edge then Ai(TV^) = ^j2, 

(ii) If e is a terminal open pair then Aj(TF^) = ^^^^ TV, 

(iii) If e is internal then Ai{TV^) = Ai{V-^)TU. 

For each of these terms we will also include a 'fidelity' error term: for terminal pairs the fidelity term 
is d{vn-^/'^)). 

If 7r(|7r| — 1) = O and 7r(|7r|) ^ {0,E}, we instead write tt = Tr~ OU, = S^^ , (3 = Q:|^|_2 and 
recall that 

^T^fev-^m-Q^-' ifUe{X',XO} 

In this case we say that only Q;|7r|-iQ!|7r| is terminal; its treatment is exactly as (i) and (ii) above. 
However, recall that a|7r|-2Q^|7r|-i ^-nd a|7r|-2CK|7r| partner pairs. We write jd = a:|^|_2 and extend 
the above definition of Ai{TV^) as follows. 

(iv) If e is a partner edge then Aj(TF^) = E/eF- ' ' Qi^"^^ 

(v) If e is a partner open pair then 

MTV') = I E/ey- MXf^p)) ■ • 2tn-V2 iiU = Y' 
Xfev- MXf^p)) ■ Ym ■ Qn-' H U = Y^. 
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We emphasise that we do not consider partner pairs to be terminal, even though one of them uses 

the last vertex of V. 

Recall that we view S^^ as counting the number of injections : a^, ai, . . . , q;|^|} — )■ [n] such 
that ip{au) = u, ipiav) = v and ip{ai) is a vertex that plays the role in the extension defined by 7r(i) 
for i = 1, . . . , |7r|. We will also use the following notation: 

• For any y < \it\ wc let vrjj, denote the prefix of tt of length y. 

• If 7r(|7r|) G {X^ , X'^, , Y'^} we let tt" be obtained from tt by interchanging I and O in 7r(|7r|). 
In the following subsections we use the following classification for a terminal open pair e. 

(a) If e is a rung and ttY^ and ttY*^ both belong to Sm we say that e is simple. If e is a stringer 
and 7r°y^ and 7r°Y^ both belong to Sm we say that e is simple. 

(b) If w{tt) = 2M and e is the terminal rung we say that e is outer. If w{7r) = 2M — 1, 7r(|7r|) = 
and e is the terminal stringer then we say that e is outer. 

(c) If e is not simple or outer we say that e is a fan end pair. 

We note that outer pairs are not simple, as adding Y'^ to any tt' with w{tt') = 2M gives a variable 
not in Sm- we apply this with n' = it when e is the terminal rung or tt' = tt" when e is the terminal 
stringer. We also note that a fan end pair e is aptly named, as in this case tt must end with an 
(M - l)-fan. 

We now calculate the contribution from each type of edge and open pair. The key point here 
is that every open pair yields a self-correcting term of the form {fy + gy)8tvn~^/'^ . Furthermore, 
the only open pairs that give significant error terms are terminal open pairs. Thus, for any stacking 
variable there are only a small number of terms that need to be handled with care when we consider 
the variation equations. 

6.2.1 Internal non-partner destruction 

Suppose that e = axCty with x < y < |7r| is internal but not partner. Set tt' = Tr\y and let W = S^^,, 
= S^^' and = S^!J^°. Note that W is in Sm, but and may or may not be in Sm- 
In any case we have 

+ W^o = ^ ^/(..) + E ^fiy-) = (1 ± 
few few 

(The attentive reader might note that the first equality above is not technically correct: we should 
have replaced y/-(a:i/) by |y/(a;i/) \ However, this gives an error term that is miniscule by 

comparison with the others, so we ignore it here, and in similar situations throughout this section.) 

For each f £ W let Ff^^^ = Xf^ji^Y' count the number of forward extensions from / to copies of 
(J, F), in which A = {a^, cxvi ■ ■ ■ , (^y}, J' = J \ J[A] and F' = F \ F[^]. Note that this is simply 
another variable in this ensemble, namely Ff^^^ = Sj'^^^y where vr' o tt" = vr. We define F^^^- similarly 
(recall that tt = ■k~U). 



30 



Since Aj(7~T^'^) = Ai{V ^)TU, applying Lemma 2.3 we have 
= E[AiiV')-Ai{TV')\J^i] 



'2V{WO + W^) 



[l±0{eYe')) 



QW 



[l±0{eYe')) 



2V-{WO + W^) 
QW 



TU± 



> (1 + o{l)){fv +gv- Oieye^) - 0{r^L'^e^))Stvn-^/'^ . 

(Note that using is an overestimate for the error in the forward extensions, but we wiU incur such 
an error elsewhere, so we use the same formula here and throughout for simplicity.) 



6.2.2 Partner destruction 

Here we consider a partner open pair e = a^jOy with x < y. In this case vrdvrl — 1) = O, 
{0,E}, X = |7r| — 2 and y G {|7r| — 1, |7r|}. Note that if y = |7r| then it is isomorphic to 
y = |7r| — 1, so we can restrict our attention to the latter. We write vr = tt^OU, 



w = si,^,w'- 



and W = SZ, 



Note that 



vr(lvrl) i 
the case 

_ C7r~ 
^uv 1 



f&W f&W 



[l±eY)2Wy. 



For f £ V we set 



We have 



Uf 



if ?7 = 



= E[Ai{V') - AiiTV) I T^] 

= E E (^/K)^ + ^^/K) ± (^/K)- - Uf 

f&V- zdXf^^^) 

2{WO + W^){V -TV) . -V2^^r^frA,^ 

= — ± Oieye'^tvn •^''^) ± 0{L^v/q) 

= (1 + o(l))Py8tn-3/2 ± Oieyehvn-^/^) ± 0{L*v/q) 
> (1 + o(l))(/y + gv- O(eye^) - 0{LH-^e'^))8tvn-^/^ . 



6.2.3 Simple destruction 

Let e = UxCty be a simple rung, i.e. the last rung of a ladder defined by vr with the property that 
ttY^ and ttY'^ both belong to Sm- We have ElAiiy) \ T^] = 2Q-^ E/ev(l + <^/(e) ± where 
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L'^ bounds the number of open pairs that close /(e) and also f{e') for some other e' E F \ J (we 
3^81. Write = S^^' and = Sl^° . Note that TV^ = TV^ = 2tQn~^l'^V and 



use Lemma 



I -.,0 ^ 2tqn^/2y_ Since AiiTV") = ^^TV we have 



V = V 



E[A,(W^) \Ti]=E [\{V') - \{TV') I 
= 2Q-\V^ + ± lV) - (2 + ASQ-^)TVQ-^ 

= 2Q-\TV^ + TV^ ± v^eyi ± v^Cyo) - (Stn^^/^ ^ 4^essq-'^)TV ± 0{L^v/q) 
= 8tn~^/^W ± (1 + o{l))8t{evi /2 + eyo /2 + e5)t;n"^/2 ± 0{L%/q) 
> (1 + o(l))(/v + 5y - ey//2 - evo/2 -63- 0{LH'^e^))Stvn~'^'^ . 

The same calculation applies if e is a simple stringer (using vr" in place of vr). 
6.2.4 Outer destruction 

Let e = Uxdy be an outer rung, i.e. e is terminal and w{tt) = 2M. For each ah ^ Q let the variable 
^uvab count the number of backward extensions Bj^ that map the last rung of S*^^ to the open pair 
ab. Recall that B-,^ is controllable and note that there can be no bad edge as M is assumed to be 



large, so we can estimate Xuvab using Lemma 5.1 Since Ai{TV'^) = ^q^TV we have 



E[Ai(W^) I J-,] = E[Ai{V') - \{TV') I Ti] 



abeQ ^ 

= 4Q-i(l ± 0{eYe^))Q~^VS - ASQ-^TV ± 0{L\/q) 
> (1 + o(l))(/y +gv- Oieye^) - 0{LH-^e'^))8tvn-^/^ . 

The same calculation applies if e is an outer stringer (using ir" in place of vr) . 
6.2.5 Fan end destruction 

For destruction, it remains to consider the case when e = axOy is a fan end, i.e. vr ends with an 
(M — l)-fan and e is the terminal rung. Let vr* = 7r\xO and TT* — 7t\x^ t)G obtciined. from tt by 
replacing the fan by an or extension. Let = SZv , V* = Sll, V = SZ', V*^ = Sfj' and 
Y*o _ 5^*5^°. "We write V = X^jgy* Ff^^, where Fj^tt denotes the forward extension, i.e. the ^m-i- 
extension from /(ax-idxCty); recall that this is controllable, so we can estimate it using Lemma 
Note that by Lemma |2.3| we have 



5.1 



This implies 



TV*' = Xfic.) ■ 2in-^/2 = (1 ± ej,j2tn-^/\V*f/V^ and 



V*'/QV* = (1 ± (1 + o(l))(ey*/ + ev*))2tn-3/2, and 
V*OiQy* = (1 ± (1 + o{l)){ey.o + ey.))2tn-3/2. 



32 



Since Ai{TV^) = ^^TV we have 

-,TV ± Oieye'tyn-^^^) ± 0{L\/q) 



4 S" 



= (1 ± ^(1 + o(l))(ey. + ev + ey./ + ey.o))8tn-3/2y _ (1 + 65)8*71-2/27-^ 

> ifv +gv-{l + o(l))(i(ey. + ev + ey./ + ey*o) + 65) - ©(eye"^) - 0{LH-^e^))8tvn-^/^ . 
6.2.6 Internal creation 

Suppose that e = a^Oy with x < y < |7r| is an internal stringer edge of vr. Let , V ^ be the 
variables obtained from V, when e is replaced by an open pair. Then TV'^ = V ^ ■ TU and 

To estimate the creation fidelity term consider e' G (^) \ F and let V' be obtained from by 
making e' an edge. Consider the construction of V one vertex at a time in vr-order. We see the same 
extensions as for V'^, except that at one step there is an extra edge incident to the new vertex. This 
reduces the scaling for this vertex by a factor of p = 2tn~^^^. When we use Lemma 



3.8 



we estimate 



to bound 

the number of copies of V , we do not get the full p reduction if < 1. However, V must have 
at least two new vertices in order for creation fidelity to be an issue, and therefore > y^qp~^. 
Note that v'^e^ > y'^qp^^e'^ = 2tq^n > 1. So max{u', 1} < f +6^, and by Lemma 
V' =0{L^\^\e^)v+. 

Since Ai{TV^) = Ai{V-^)TU we have 

^[AiiVV') I J-.] = E [A,{V^) - A,{TV') \ T^] 

= 2Q- ± 0(L^l^le2)) - 2Q-^V+-{1 ± ©(L^l^'e^)) • TU 
= 2Q-\V+ - TV+) ± 0{L^\^\e^q-\+) 
= (1 + 0{eQ))2q-^VV+ ± 0{L^^^^e\'\+) 



(l + o(l))riey+ ±0(t-iL4|^le2) 



6.2.7 Partner creation 



Suppose that e = axOy with x < y = |7r| is the partner edge of vr. We can write tt = tt OV^. We 
estimate the creation fidelity term as for internal creation. Then 

E[A^{VV') I Ti] = E[Ai{V') - AiiTV) \ T,] 

= 2Q-\V+{1 ± 0(L^l^le2)) - TV+), 

= (1 + 0{eQ))2q-^VV+ ± 0{L^\^\e^q-^v+), 
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which gives the same expression as for internal creation. 
6.2.8 Terminal creation 

Finally, suppose that e is the terminal edge of tt. Then vr(|7r|) is E, or Y^, and if 7r(|7r|) = Y'^ 
then 7r(|7r| — 1) / O (otherwise e would be partner). Let V'^ be the variable obtained by changing e 
to an open pair, i.e. replacing Y by X inU = 7r(|7r|). Note that TV~^ = TV ■ ^^zji v'^ = v- ^^^tT- 
We estimate the creation fidelity term as for internal creation. Since Aj(7T^^) = we have 

which gives the same expression as for internal creation. 
6.3 Vairiation equation 

Now we combine all the estimates in this section to show that ZV = VV — eyv forms a super- 
martingale. Recall that E[Aj(PF) | Ti] = ^^^^ ibE[Aj(Py^) | Ti], where the summand is positive 
for an edge and negative for an open pair. There are e{V) edges, each giving a creation term of 
(l + o(l))(t~^ey+ ibO(t~"^L*^l^le^))i;n^'^/^. There are o{V) open pairs, each giving a destruction term 
in which the main term is a self-correction term of (l + o(l)(/v+5'y ±0(eye'^)ibO(L'^t^^e'^))8tt;n~^/^. 

There are additional error terms that come from terminal open pairs, which we denote by eaddi 
the form of this error term depends on the form of the terminal open pair. The constants are 
carefully chosen to ensure that this term does not spoil the supermartingale condition. 

• If the terminal open pair is outer then there is no additional error term at all, i.e. eadd = in 
this case. 

• If the terminal open pair is simple then the error term is 

eadd = (1 + o(l))(ey//2 + eyo/2 + 65). 

Note that 

CyO = Cyl = Cy/9. 

• If the terminal open pair is a fan end, i.e. the last rung in an (M — l)-fan, then the error term 
is 

eadd = (1 + o{l)){l{ev + ev + ey.i + ey*o) + es). 

Note that the sequences defining V* and V* each have M — 1 fewer symbols than vr, but also 
have an additional 'O' or 'E'. Thus 

cv* = cy = cy/9 and Cy*! = Cy*o = cy/81. 
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Observe that vr can have two terminal open pairs (both simple, or one simple and one fan end), and 
we must add the estimates from both to get a worst case bound. Writing eadd = /add + ^^add ™ the 
natural way, we have 

^ ^t-^gv if i < 1 



/add <mfv and 5add < < 



81 

iit>l. 



(4) 

81^ 



We have 



^^^g^3/J ^ ^((1 + om-W^ + Or^L^l^le^)) - (1 + o{lMV){fy + gy) 
+ (1 + o(l))eadd + O(eye^) + 0{LH-^e^). 
As for general extensions we write 

Ai{evv) = ev{t + n"3/2)T;(t + n"^/^) - ey(t)v(t) = {eyv'/v + ey)t;n"^/2 + 0(t^ + r'^)evvn~^, 
where we bound the last term by 0{evvn~^/^). Note that 

e'v > 4:tfv + {■&'/■& - e(F)i-^ + At) ■ 2gv. 

Since V'^ and V have the same length and the same number of copies of O and E, we have cy = Cy+ 
and so ey+ — ey = 2(5y+ — gy)- We also have gy+ < 2tgy (and there is no term if e{V) = 0). 
Thus 

< (1 + 'ey+ + 0(t-iL^l^le2)) - (1 + o(l))(o(F)(/y + - eadd) 
+ 0{eye') + 0{LH-^e') - ((^ - o{V))ey + §) 

< ^ • 2(ffy+ - gy) + o(y)5y + (1 + o(l))eadd - fv/2 
-{^-^ + l)-2gy + O(eye^) + (^(t-^L^I^Ie^) + 0(6^+^^) + o{ey) 



< 9v 



0{V) + ^ - ^ - 1 + 2(1 + o(l))9add - + (1 + o(l))/add 



+ O(eye^) + 0{t-^L^\^\e^) + o{ey+t-^) + o{ey). 

Since —gy'd'/{At'&) dominates for t ^ 1 and —fy/2 dominates otherwise, we have E[Aj(^y) | J^j] < 0. 

Finally, note that the above calculation would have still been valid if we had replaced ey by 
ey/3; we will use this observation for open degree variables to control degree variables in the next 
subsection. 

6.4 Boundedness hypothesis 

We start by recalling we cannot apply our general strategy to vertex degree variables, as the function 
gy{t)v{t) is not approximately non-increasing, so we give a separate argument for these variables. 
Consider the degree variable l^(i') for any vertex u and step i' . For each \ <i <i' , the probability 
that we choose an edge incident to u is 

Xu{i) (l±exi/3)xi / , ,^2 



Q{i) {l±eQ)q 



(l±(l + o(l))exi/3)-. 
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using the observation in the previous subsection that we can bound the relative error in Xu{i) by 
exi/3. Thus we can couple l^(i') to a sum of independent indicator variables with mean 

i' 

(l ± (1 + o(l))ex,(t)/3)2/n = (l ± (1 + o(l))ex,(t')/3)yi(t')- 

i=l 

Since fy^ = fxi and gy^ = gxi (l+t^^)/2 we have ey^ > /2, so if we do not have Yu(i) = (Iztey^ )yi 
then Yu{i) deviates from its mean by more than > L^^n^^'^. By Chernoff bounds, whp this 

does not occur for any vertex u. 

Now recall that for any variable V other than the vertex degrees, it suffices to show Vary < 
{gvv)^/{L^n^/^) and Ny < gyv/L'^. 

Fix TT G Sm-, a non-edge uv, and let V = 5^^. Let (J, F) be the underlying graph pair of vr and 
A = uv. Since gyv > cyL~^(l + t~^'^'^^)ev and e^n~^/^ = (7"^, it suffices to show 

Vare < L'^q-\cv{l + t-<^^)vf . 

We analyse the contributions from each edge e = Uxdy £ F \ F[^] separately. There are two 
cases, according to whether e is an open pair or an edge. 

First suppose axOy = e G T \ J is an open pair (this can be either a rung or a stringer). Let 
(J',F') be obtained from (J, F) by 'gluing a ^-variable on axCty^ as follows. Let 7 be a new vertex, 
V = Vr U {7}, J' = J U {0^,7} and F' = F U {axj,ayj}. Note that this definition depends on 
the order of Ox and Oy. We analyse closures of e by adding the edge corresponding to ax7- Let 
A = {au,av}, A' = Au {ax, 7} and Sm = min^/c_Bcv' S^, where all scalings are with respect to 
(J',F'). Let i?m be the minimal set B with A' C B C V' that achieves = S^- By Lemma 



3.8 



we can bound the probability Pe that ^0,j,r is affected by closing an ordered edge corresponding to 
e by 

as this event is contained in the event that the new edge is contained in an extension to B^- (We 
calculate the impact of closure of e on the one-step variance of V. For many edges we could get 
better bounds for inner open pairs by working instead with DV, noting that B^ is the same for V 
and V~ , but this is not necessary for the bounds we prove here.) Furthermore, we can bound the 
magnitude in the change in V by 

Since S\' = yS^ = yv, the one-step variance in V satisfies 

We can bound S^a from below by accounting for the scalings contributed by vertices in vr order. Each 
vertex contributes at least an additional pqn = y to the scaling. If \Bm \ A\ > 3 it follows that 
Sm>y^> L20|^'l+6y2^ which gives the desired bound on the one-step variance. 

It remains to consider {B^ \A\ < 2. First consider the case {B^ \A\ = 2. We view the extension 
from A to Bm as a sequence of two one- vertex extensions with the extension to 7 taken last. Note 
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that each of these two steps is an extension in the stacking ensemble. If either of these steps is 
not a y^ft-extension then we have S^a > xy > L20|V"|+6y2 ^i^j^jg^g cannot be any Y^-extension as 
Sni contains the open pair Ox^)- This gives the desired bound on the one-step variance. So we 
may assume that both steps are Yafe-extensions. Now we can use stacking variables to estimate pe 
and Ne, since {A,B^) induces the extension 5™^''^ , and = S'^^^^, where vr = 7r(l)7r'. Then 
we can use the better bounds pe < 2Sm/q and A'^e < 25^ /S^a in the above calculation, obtaining 
Vave < 8y^f ^/(gSin). This suffices, as 5m > and cv > L\ 

Finally, consider \B^ \ ^| = 1. Here we have = {a„,ai,,7}, = qn and pe < 2S„Jq- We 
bound Ne by taking the extension from B^ in two steps: first an extension from B^ to a fan $e and 
then a forward extension to the remainder of Sl^^. We can estimate the latter with a proportional 
error of 0(ey). It remains to estimate the number of extensions to If we consider any time t for 
which <I>e is controllable then we have Nf> < 2yv/ S^a, so again we can improve the above calculation 
to obtain Vave < Sy'^v^ / {qSm)- This suffices as = qn> y'^/2 and cy > ■ On the other hand, if 
the fan $e is not controllable at time t then either <I>e is an extension to a single vertex or g < n*^"^/^. 
If g < n^~^/^ we have = qn > L^'^l^'l+^y^, which suffices as above. If <&e is an extension to a 
single vertex we estimate N^, < ■ vy/Sm- This gives Var^ < 8L^v'^/q, which suffices as cy > L^^. 

Now suppose that e G J is an edge and e = UxOy where x < y. Let A' = AU {ax, ay} and 
5*111 = niinyi'cBcv S^, where all scalings are with respect to (J \ e, F). Let i?m be the minimal set B 
with A' CB that achieves = Sm- Now we have pe < L'^^^'^Sm/q and iVg < L^^^'^S\/Sm- It 
follows that the one-step variance in V satisfies 

Fare < PeN! < L'^\^'\{SJ q){Sl/ S^f = L^'\^\qp''vf / {qS^). 

Again we calculate the scaling S^a one vertex at a time. Each vertex contributes at least an additional 
pqn = y to the scaling, and Oy contributes at least q'^n = x, since the edge a^ay is switched to an 
open pair in (J \ e, F). If \Bm \ ^| > 2 we have 5m > xy, so 

Var,<y-^L^^\'''\q-\t-\f, 

which is sufficient, as e G J implies e(y) > 1. It remains to consider the case \B^ \ A\ = 1, i.e. 
Sm = A' and e creates the first y-extension of vr (using the fact that V is not a vertex degree 
variable). Then pe < 2x/q and A'^e = S^^^_^ < 2v/y, so Vave < 8q^^{t^^v)'^, which suffices as 
cv > L\ 

Thus we have verified the trend and boundedness hypotheses for stacking variables. By Freed- 



man's inequality, this completes the proof of Theorem 6.1, and so of the lower bound in Theorem 



7 Independence number and upper bound 



After some establishing some preliminary facts in subsection 7.1 we will prove Theorem 1.2 



m 



Section 7.2 Then we will apply a similar and easier argument in Section 7.3 to obtain the upper 
bound that completes the proof of Theorem |1.1[ 
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7.1 Preliminaries 



We start with an observation that will be used many times in this section to estimate the one-step 
variances due to destruction. Suppose F is a variable, e = uv is an open pair in V, and we want to 
estimate the one-step variance Varg due to destruction of e. Suppose that at most configurations 
in V can be closed in any given step. Consider the bipartite graph H with parts {A,B), where 
A = V IS the set counted hy V, B = Q is the set of ordered open pairs, and f £ A is adjacent to 
b £ B ii selecting b as an edge closes /(e). By assumption dnib) < N for all b £ B. We also have 
e{H) = 2 Zfeviyjiuv) + Yfivu)) = (1 + o{l))AyV. Then 

Vare < Q'^ ^^^(^)^ < (1 + o{l))q'^e{H)N = (1 + o{l))8tn-^/^ NV. (5) 

Next we prove some lemmas on counting open pairs. For any set S let Qsit) be the number of 
ordered open pairs in S at time t. For any sets A, B let QAB{t) be the number of open pairs ab with 
a £ A, b £ B a,t time t. 

Lemma 7.1. Whp for any set S of size s we have the following estimates. 

(i) Suppose that s > n}!^ , ip > n^^^^ , h < L^^'^ip'^qs and any vertex has at most h neighbours in 
S. Then Qs = {l±ip)qs'^. 

(a) If '4}> n~^/^ and s > V~^v^ then = (1 ± %l))qs^ . 

(Hi) If s < L^'^y/n then Qs < L^'^sq^/n. 

Proof. First consider statements (i) and (ii). We use critical window analysis for t > n~^'^ to prove 
the bound Qs = (1 i eo)qs'^, where eo = {I + t/ L)'ijj/2. This suffices as eo < V'- We use the window 
[(1 + eo — 9o)qs'^, (1 + 60)95^]) where go = ip/i^OL^). First we use coupling to the Erdos-Renyi 
process to show that Qs does not enter the critical window at t = n^^'^. This follows from the trivial 
upper bound Qs < s^, and the lower bound Qs > — 5n^''^s, obtained by subtracting the number 
of paths of length 2 starting in S in the random graph. 

Next we establish the trend hypothesis that ZQs = Qs — Qs'^ — ^oQs'^ is a supermartingale while 
Qs is in its critical window. The expected change in Qs is 



nAiQs I -Fi] = -2Q-' ^ {Yab + Yha) = -8tn-'^'{l + 0(ey))Qs. 

abeQs 

We also note that Ai{qs^) = (-Stn^^/s + 0{L^n-^))qs'^ and Aj(eogs^) = (1 + o{l)){{L + t)-'^ 
8t)n~^/'^eoqs'^ ■ Using g^^'j^^^ > 2go and ey < n~^/^ = o{go) by Lemma 



3.5 



we have 



n^^ZQs I H < -(1 + o(l))8tn-3/2gs2 . (eo -go- 0(ey) + (^^(^^ - l)eo) < 0. 

For the boundedness hypothesis, we take a union bound over S and apply Freedman's inequality. 
Write No and Varo for the maximum one-step change and variance of Qs- Since go = ip/{40L'^), it 
suffices to show that No < L~^^tp'^qs, as by ^ this also implies Varo < L~'^n~^/'^ {L~'^ijjqs'^)'^ . 
This holds by our assumptions, using No < h for (i) or No = 0{y) for (ii). 
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It remains to prove (iii). In this case we write eo = (1 + t j V)L}^^ sq^fn jl. Note that the bound 
is trivial for t < 1, as s < LP'^Jn implies Qs < < eo- For t > 1 we use critical window analysis 
with the window [eo — go,^o], where go = eo/(40-L^). When Qs is in the critical window we 
estimate K[AiQs \ J^i] < —(1 + o(l))8tn~^/^(eo — go)- We write ZQs = Qs — and note that 

= (1 + o(l))((L + t)^^ — 8t)eo- Thus we obtain the trend hypothesis 

n^iZQs I H < -(1 + o(l))8tn-3/2 . (eo - go + - l)eo) < 0. 

For the boundedness hypothesis we use No ^ '^y ^ L^^s^^go, which implies Varo < gQn~^/'^ , 

so we can apply Freedman's inequality. □ 

We also need a bipartite version of Lemma |7.1[ i). The proof is essentially the same, so we omit 

it. 

Lemma 7.2. Suppose r,s > n^^^, ip > n~^/^ and h < L~^^'ip'^qmm{r, s} . Then whp we have 
Qrs = (1 =t 'ip)qrs for any sets R, S of respective sizes r, s such that any vertex that has a neighbor 
in one of these sets has at most h neighbors in the other. 

Next we establish some density estimates. For a set 5, let Es denote the number of edges of 

G{t max) in "S"- 

Lemma 7.3. Whp for any set S of size s 

(i) if s> L^'^y/n then Es < L^W^/'^s^, 

(a) if s < L^"^^ then Es < L^^s- 

Proof. For (i), we estimate the probability that some such S spans M := L^n'^^'^s'^ edges, taking a 
union bound over S and the steps at which the edges are chosen, for which there are (") (]^) choices. 
For a specified step at time t, the probability of choosing an edge in 5 is Qs{t)/Q{t) = (l+o(l))s^/n^, 
using Lemma |7.1[ ii) . Thus the failure probability po satisfies 

"<:)(m)«'+°«''^''"''"- 

Then logpo < O(logn) + (1 + o{l))Ms-^ log ^ < -l?n'^l'^s < -L^- 

For (ii), we estimate the probability of choosing an edge in 5 as Qsit)/Q{t) < 2L^^sn~^/^ by 
Lemma |7.1[ iii). Then 

so logpo < O(logn) + Lis log <-L^^. □ 

We can deduce a bound on the number of vertices of large degree in a given set. Let Dd{S) be the 
set of vertices that have degree at least d in S- 

Lemma 7.4. Whp for any set S of size s 
(i) ifs> and d > SL^n-^/^s then \Dd{S)\ < SL'^n'^'^s'^ /d, 
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(ii) ifs< Li2^ and d > then \DdiS)\ < AL^^s/d. 



Proof. For (i), suppose on the contrary that there is T C Dj^[S) of size SL'^n-^/^s'^/d. Then 5ur 
is a set of size at most 2s that spans at least d\T\/2 > L^n~^/^(2s)'^ edges, which contradicts Lemma 



Y^i). Similarly, for (ii), if there is T C Dd{S) of size AL^^s/d then |5 U T| < 2s < 2L^2^ 
EsuT > d\T\/2 > 2L^^s > L'^^\S U T\ edges, which contradicts Lemma [7^ □ 



Next we introduce absolute constants < a < 7 < /3. We emphasize that these constants are 
chosen in the order listed; that is, a is chosen to be sufficiently large, 7 is chosen to be sufficiently 
large relative to a and /? is chosen to be sufficiently large relative to a, 7. (In an attempt to make clear 
the role these constants play, we do not substitute actual values. But for concreteness we note that 
they can take the values a = 25,7 = 50, /3 = 600.) These constants will control the polylogarithmic 
factors in our arguments. When these factors are unimportant we will use 'tilde' notation: we write 
/(n) = 0{g{n)) and g{n) = (l{f{n)) if f{n) < {\ogn)'^g{n) for some absolute constant A. 

Given a vertex x and a set H, let WxH denote the number of ordered triples (a, 6, c) of vertices 
such that ax is open, {6, c} C and a6, ac are edges. 

Lemma 7.5. Whp for every set H of size h with L"' < h < L~^y/n with the property that for any 
vertex a 

(i) there are at most edges ah with b £ H, and 
(ii) there are at most 2x = 2(f'n open pairs ab with b ^ H , 
we have W^h < L^°^hqy/n for all x ^ H. 

Proof. We start with some definitions. We say that a vertex a is heavy with respect to H at time t 
if at least q\/nL~'~^ pairs ah with h £ H are open. We say that a is obese with respect to H at time 
t if at least q^/nL'^ pairs ah with b £ H are open. For any obese vertex a we declare some subset of 
the open pairs ab with b G H inactive so that the active open degree into H is [q^/nL'^\ . We stress 
that the status of an open pair as active or inactive can change back and forth in the course of the 
process, but once a pair is chosen as an edge its status as active or inactive remains the same for the 
rest of the process. For j G {0, 1, 2} let W^^ denote the number of ordered triples (a, b, c) of vertices 
such that ax is open, {6, c} C H, the pairs ab and ac are both active, and their status depends on j: 
if j = then both are open, if j = 2 then both are edges, and if j = 1 then ab is open and ac is an 
edge. Thus W^j^ has the same definition as WxH-, with the additional condition that ah and ac are 
active at the steps they are chosen as edges. 

First we show that there is a negligible difference between WxH and W^^j , so it suffices to bound 
the latter. Let O be the set of vertices that are obese with respect to H. We claim that 

\0\ < 2/iLi3-7_ 

For suppose not, and consider O' O of size 2hL^^^'^ . Then \HUO'\ < 2h and QhuO' ^ 2L^^hqy/n. 



This contradicts Lemma 7.1 iii), so \0\ < /iL^^~'^. Applying Lemma 7.1 iii) again, we bound the 
number of open pairs in U O by Qhuo < 'ihL^^q^/n/2. Thus the probability that we choose an 
edge between an obese vertex and H is at most 2hL^^n~^^'^ . By coupling with a binomial variable 
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and taking a union bound over H, whp for all H we bound the number of edges in H L) O hy 
EhuO < hL^^. By assumption (i) this gives WxH — < hL^^, which is negligible by comparison 
with the desired bound on WxH- 

For the remainder of the proof, we use critical window analysis to prove the bounds 

< Wo ■■= L^'^^'^hxqy/n 
WxH <W2:= L~'^hqy/n. 

For W^^ we use the critical windows [ej — gj,ej], where Cj = (1 + t/L)wj/2 and gj = Wj/{40L'^). 
This suffices as ej < Wj for j = 0, 1 and 62 + hL^^ < W2- 

First we use coupling to the Erdos-Renyi process to show that the variables do not enter the 
critical windows for n~^/^ < t < 1. When j = we use the trivial bound W^fj < nh? <C wq (since /3 
is large compared with a). When j = 1 we bound W^jj by picking {b, c} C H then a neighbour of c. 
Since the random graph has degrees (1 + o(l))pn = 0{y) for t < 1 we obtain < 0{y)h^ ^ wi. 
When J = 2 we bound W^^ by picking {6, c} C. H then a common neighbour, for which there are at 
most 0{L'^) choices in the Erdos-Renyi process, so W^^ = 0{L'^h?) <C W2- 

Next we prove the trend hypothesis, i.e. that ZW^jj = W^^j — ej is a supermartingale while W^jj 
is in its critical window. We analyse the contributions to K[AiZW^^ \ J^i] separately according to 
each of the pairs ax, ah, ac. 

Note that we cannot count the contribution of the destruction of ab or ac when a is obese because 
we might only change the status of some inactive pair when such an active pair ab is closed. Let Aa 
denote the set of 6 G such that ab is open and active. The contribution to E[AiW°^] due to the 
closure of pair ab or ac where a is obese is at most 

a&O beAa 

Similarly, the contributions to E[AjW^j:^] due to the closure of pair ac where a is obese is at most 
2^"^ E E ^^'^b + ^ba)L^ < ^yq'^L^ ■ 2hL^^-^ ■ qv>l'^V< < Stn-^/^dL-^. 

aeO h^Aa 

We will see shortly that these contributions are negligible. When wc calculate the expected change 
due to destruction of ah or ac below we restrict our attention to vertices a that are not obese. 

In the calculation of the expected change in ZW^jj = W^^ — Cj we write Aj(ej) = (1 + 
o(l))e^n-3/2, noting that Cg > ((L+t)~^-24i)eo, e[ > (t-^-16t)ei, 63 > ((L+t)"^ -8i)e2. For each 
open pair a/? we have a destruction term of 2Q~^ J^f^w^ (^/("/S) ^fifioc)) ^ (1 + o(l))8tn~^/^(ej — 
gj). This gives self-correction against a corresponding 8tn~^/^ej term in Ai(ej). For each edge 
we have a creation term of 2Q~^W^]j^ < (1 + o{l))2q~^ej-i. Note that 2q~^eo = f^rf^/'^ei and 
2q~^ei = 2L~^n~^/^e2. Using g^^^'+t) — obtain 

n^iZW^H I H < -(1 + o{l))8tn-^/^ ■ (3(eo - go) + {st(hr) " 3)eo) < 0. 
K[AiZW^H I < -(1 + o(l))8tn-3/' • (2(ei - 51) - ^e-, + - 2)ei) < 0. 
E[A,ZW^H I J'^] < -(1 + o(l))8tn-3/' • (e2 - 52 - 4^3^62 + - 1)62) < 0. 
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It remains to show the boundedness hypothesis. Note that since we can restrict our attention 
to i > 1 the functions gj are approximately decreasing. For each pair e in W^jj let Ne bound the 
one-step change and Var^ the one-step variance due to e. To apply Freedman's inequality, since 
9j = '"^i/(40i^), it suffices to show A'^^ < Wj/{hL^) and Vave < w'j / {hL^-n^'/'^), where A'^^ is the 
maximum positive one-step change in ZW^jj. In some cases we will show the stronger statement 
A'^e < Wj / (hL^^), which clearly gives the desired bound on Nj^, and implies the desired bound for 
Vare by ([sj). 

First we note that the required bounds for creation hold trivially as A'^e < yL'~' <^ wi/{hL^^) for 
W^H and Ne<L^<^ W2/{hL^^) for W^jj. 

For destruction we obtain negative changes in ZW^^, so we only need to bound Var^. First we 
introduce some additional definitions. Let T = T^h be the set of heavy vertices a such that xa is 
open. Appealing to the control we maintain on Xuv, we have 

|r| < 2hx/{qy/^L-"<) = 2hL^qy/^. 

Let U be the set of vertices z such that zx is open and z has at least qy/nL~^^ neighbors in T. By 
Lemma 17.41 we have 

fs/iL^^+i^ if 2/iLT-i2^ < 1, 
)32h'^L^"'+'^q otherwise. 

Here we used qy/nL~^"' > 4L^^ and qy/nL^^"' > SL'^n^^^'^ ■ 2hL'^ qy/n, which follows from the fact 



\U\< 



that we choose /3 to be large relative to 7, to get the lower bounds on d required for Lemma 7.4 

Now consider destruction for the variables W^^^ for j = 0, 1. We write AiW^^ = AjVi -|- AjV2, 
where AjVi accounts for the change in V = that comes from the choice of an edge xz where 
z & U, and AjV2 accounts for the rest. For AjV2 the bound on Vare will follow from the bound on 
A'e and ([s]). The contribution to A'^e from the closure of pairs ab or ac is clearly sufficiently small. 
Next consider the contribution from closure of pairs xa where a is not heavy. For j = this is 
{2y){q^L-^f < 2qy/^L^-'^^ x , which suffices as Wo/{hL^^) = L'"'-^^ xqy/n is much larger as 7 
is large relative to a. For j = 1, appealing to condition (i) in the statement of the Lemma, the 
contribution is at most {2y){qy/nL~^)L'^, which suffices as wi/{hL}^) = L~°'~^'^yqy/n and 7 is large 
relative to a. Now consider the contribution from the closure of pairs xa where a is heavy (but 
we do not select xu with u £ U). Since we only consider active pairs and t > 1, this is at most 
qy/^L-^^{qy/^L^f = L'^xqy/^ < wo/{hL^^) for j = 0, or q^L'^^ {q^L^)L^ < wi/{hL^^) for 
j = 1- Thus we have the required bound on Ng for Ajl^. 

For j = 0, 1 it remains to bound Var^ for AjVi. The probability that an edge xz with z £ U 
is chosen is at most 2\U\/q, and the resulting change in W^j^ is at most {2y){qy/nL^Y 3 — 0; 
or {2y){qy/nL'~^)L'^ for j = 1. Suppose first that 2hL'^~^'^q < 1. Then for j = we have Var^ < 
l6hL^"'+^^q-^{2yf{qy/nL'y)^ = d{hq^n), which suffices as u;g/(/iL%3/2^ ^ n{hfn^/^). Also, for 
j = 1 we have Vare < l&hL'^^+^^q-^(2yf{qy/^L^fL^ = d{hLf), which suffices as wl/ {hL^ri^''^) = 
0(/ig^n^/^) (recalling that we restrict our attention to t > 1). Now suppose 2hL'^^^^q > 1. Then for 
J = we have 

Vare < 64/i2L5T+2n-2(2y)2(^y^LT)4 < 256h'^L^^+^fn, 
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and for j = 1 we have 

As (3 is chosen to be large relative to a, 7 these bounds suffice. 

It remains to bound Varf, for destruction of W^^. Let W be the set of vertices that are open to 
X and have at least two neighbors in H. By our bound on Y^v we have \W\ < 2yh. Let U' be the 



set of vertices that are open to x and have at least yL neighbors in W . Note that by Lemma 7.4 
we have 

yi^h^yrr^l'^Ui^'^ otherwise. 
Here we used yL~'^ > 4L^^ and yL'"' > 8L^n~^/^ • 2yh, which follows from the fact that /3 is 



chosen to be large relative to 7, to get the lower bound on d required for Lemma 7.4 We write 
AiW^jj = AiVi + AiV2, where AjVi accounts for the change in W^jj- that comes from the choice of an 
edge xz where z € U' , and AjV2 accounts for the rest. To bound Vare for AiV2, note that by condition 
(ii) in the statement of the Lemma we have A'^g < yL^^"' < W2/{hL^^). For AiVi, suppose first that 
2yh < Then Vare < 8hL'y+^^q~^{2y)'^L^^ = d{hyn-^/'^), which suffices as wl/ {hL^n^/"^) = 

f^(/iy2n-3/2). On the other hand, if 2yh > L^^^ then Vare < 32h^yn~^/^L"'+^q-\2yfL^^ < 
y^n which also suffices because /3 is chosen to be large relative to 0,7. □ 



7.2 Proof of Theorem ICT 

Let e' = 3e and k = {I + e')^/2nlogn. We show whp a{G) < k. Note that a{G) < a{G{tmax)), so it 
suffices to bound a{G{tmax))- We need to estimate the probability that there is an independent set 
K of size k. We will take a union bound over all such sets K together with information about how 
neighborhoods in G{tmax) intersect K. We define a sequence of vertices xi, . . . , x^, where each x^ is 
chosen to maximise the number of neighbours in K that are not also neighbors of some Xj for j < ^. 
More precisely, the ^th 'hole' is Hi^ = {N(xe) \ U^'<^A^(x£/)) n K, where X£ is chosen to maximise 
hi = \H£\. We stop the sequence if there are no vertices that give more than L'^°' new neighbours 
in K. Note that xg ^ K for ^ G [z], as K is independent. We say that a hole is 'large' if has size 
more than L"f^ ^/n. We let Za be the set of £ such that is large, Zb = [z] \ Za, A = U^g^^if^, 
B = U^g^gi/^, C = K\{AU B). For £ G Zb we write Hg = {vgj : j £ [hi]}, where xv^j is selected at 
step iij, and iej is increasing in j. For i G Za we specify the entire neighbourhood of xi in G{tmax)'- 
we write dg = \N{x()\ and N{xi) = {vij : j G [di]}, where xv£j is selected at step igj, and iij is 
increasing in j. We will estimate IP(f), where £ is the event that there is an independent set K with 
some fixed choices of z, xg, h^ for I G [2:] and dg for (. G Za- We will refer to these choices of hole 
sizes, vertices with large neighborhoods in K and vertex degrees as the initial data that defines 8. 
Note that by Lemma 7.4 we can assume 



\Za\< 8L^^+^ and z < AL^^-^'^k. 
Also, using our bounds on vertex degree variables, for i G Za, j G [dg] we can assume 

iej = jn/2 ± n^/^-^/s and di = d± n^/^-e/s^ 
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Now in addition to the initial data, we fix the independent set K and the specific edges and 
appearance times V£.,iij for i £ ZA,j G [di] and vgj,iij for i G ZB,j G [hi]. We let £k be the 
event K is independent and all the specified edges appear at the specified steps of the process. Note 
that £ is a union of events of the form £k- To estimate the probability of the event £k, for each 
step i we need to estimate the probability that the selected edge is compatible with £k, conditional 
on the history of the process. We say i is a selection step if i is one of iij for £ £ Za, j G [de] or 
£ £ Zb, j G [hi]; then the selected edge is specified by £k, so the required probability is simply 
2/<3 = (1 i 2eQ)2q~^. For other i, the required probability is 1 — Ni/Q, where Ni is the number of 
ordered open pairs that cannot be selected at step i when £k occurs. If i = iij is a selection step 
write Ni = and tij = t. Then we estimate 

d£ he m 

nSK) < n ± 2eQ)2g(t,,)-i • n 11(1 ± 2eQ)2g(%)-i • - N,/Q). (6) 

To estimate Ni, we classify open pairs that cannot be selected at step i as follows. 

• Let NiAi be the number of ordered open pairs of the form Vijvgj' for some i £ Za, £ [de]. 

• Let NiAo be the number of ordered open pairs of the form X£y or yx£ where i £ Za and 
y ^ KU {xi, . . .,Xz}. 

• Let NiB be the number of ordered open pairs ab such that selecting Cj = ab would close an 
open pair of the form xiv^j for i £ Zb, j £ [he\. 

• Let NiK be the number of ordered open pairs in K that are not contained within any hole. 

We write Ni > NiAi + NiAo + NiB + Nix — Nio, where Nio corrects for any overcounted pairs. 

To estimate NiAi, for i £ Za define jc = to be the value of j £ [di] such that iij < i < i^Q+i), 
i.e. je edges have been selected at xe. Let Se = {vej}'^Lje+i aiid si = jS^I = di — je. Note that the 
number of ordered open pairs vejVgj' with j > ji, f < ji is YjveSi '^^vxi = (1 ± eY)2yse- Also, by 



the bound on codegrees, any vertex has at most neighbours in S^. Then by Lemma 7.1 i) whp 
QS), = (1 ±n~^/^)^s| if si > n}-!^ and qsi > r?'^l'°L^^. Since q > rT^^'^^^ this holds for sn > 
so we can write Qsi > (1 — n~^^^)qse{se — The bound on co-degrees also implies that the 

number of open pairs that can be counted by more than one £ £ [Za[ is at most (|Z^|L^)^ = 0(1), 
which is negligible. Thus 

N^Ai > (1 - ^ (2yse + qse{se - n'/'-'/')) = {2yse + qsj) - d{qn^-'">). 

e&ZA i&Za 

Using control on the open degree of vertices, we can also estimate NiAo by 

NiAo > 2[Za[ (qn{l - n'^l^) -k-^ > 2[ZA[qn{l - n"^/^) (7) 



To estimate NiB, we say an open pair that closes Xf^v^j for some £ £ Zb, j £ [hii\ is outer if it 
contains X£ or inner otherwise. We write N^b = NiBi + NiBo, where an ordered pair counted by NiB 
is counted by NiBi if it is inner or NiBo if it is outer. Let Si be the set of vij with j £ [h(\ such 
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that xe,viij is still open, and write si = \S(\. Each vij in Si contributes 2Yy^.xi = (1 ± eY)2y to NiBi 
and 2Yxfvej = (1 i eY)2y to NiBo] however, we need to account for open pairs that may be counted 
by more than one pair x^Vfj. For outer pairs, this may occur for xiv^j and x^viji with i E Zb and 
G Si. Assuming that > L", the number of such overcounted pairs is Wx^Si < L^^'snqy/n 
by Lemma |7.5[ (Note that we apply the upper bound on the size of a hole with £ G Zb here.) 
Summing over i G Zb^ using \Zb\ < z < 4L^^~^"A; and ^^^^^ S£ < k we obtain 

NiBo > (1 - ey)2y ^ (s^ " " L-°s^gV^ > 2y ^ - L'^-'^kq^. 

ieZB ieZB leZB 

Next note that an inner pair for x^v^j contains v^j. Furthermore, it cannot be of the form V£jV£'ji 
for some i' £ Zb with I' ^ i. For this would require the edges xevi'ji and xi/Vij, which cannot both 
exist by the hole construction procedure. Thus there is no overcounting for inner open pairs, and we 
obtain NiBi ^ (1 — ^Y)2yYle:^ZB finally note that inner open pairs have at least one vertex 

in K while outer open pairs have no vertex in K (since we may assume K is an independent set), 
thus the collections of open pairs counted by NiBi and N^bo do not intersect. In total 

NiB > (1 - ey)4y ^ - L^'-'^kq^ = 4y ^ - 0{L-^qn), (8) 

i&Zg I&Zb 

provided a is sufficiently large. 

Now we turn to an estimate for Nm ■ We begin by defining a partition of ^ U that plays a key 
role in this discussion. We write 

h* = h*{i) = min{n2/5,L-50g^}, 



and let t £ [z + 1] be such that /i£ > /i* for 1 < £ < £* and he < h* for i* < £ < z. Let 
Ji = Jiii) = ^e<e*H£ and J2 = ^2(^) = '^e>e*H£. Note that any vertex has degree at most 
h* in J2, by the hole construction procedure. Let NiKB be the set of ordered pairs counted by 
NiK with first vertex in J2 and let Nixc be the set of ordered pairs counted by Nix with first 



vertex in C. By Lemma 7.1 i) whp Qj^ = (1 it L ^)^|J2p if q\J2\ ^ L'^^h*, so we can write 



Qj^ > {I — L ^)g| J2|(| J2I — L ^^^/n). Let Q'j be the number of ordered open pairs in J2 that are 



not contained within any hole. Then 



Qj, > Q.h - h*\J2\ > (1 - L-^)g| J2|(| J2I - 2L-30V^). 



Next let T be the set of vertices that have at least Lp'^h* neighbours in Ji. We can assume \T\ < 



4L-^\Ji\/h* < 6L-^^/n/h* by Lemma 7.4 It follows that |A^(r)n J2I < 61'^^. Applying Lemma 



7.2 



withi?= Ji and5 = = J2\N{T), whp Qj^j/ = {l±L-^)q\Ji\\J^\ if gmin{| Ji|, | J^|} > L^^/i*, 
so we can write 

Qj,j^ > (1 - L-'m\Ji\ - L-^Mm - 7L-^v^). 

We can apply the same argument to open pairs between J2 and C. Let T' be the set of vertices 
that have at least L^''"'"^" neighbours in J2. We can assume \T'\ < 4L^^~^"| J2I < 6L~^~^"-^/n by 
Lemma 7.4 Since any vertex has at most L^" neighbours in C, it follows that \N{T')riC\ < %L~'^^Jn. 



Applying Lemma 7.2 with R = wid S = C = C\N{T'), whp Qj^c' = (1 ± L~^)q\J2\\C'\ if 



45 



gmin{| J2I, \C'\} > so we can write Qj^c' > (1 - L-^)q{\J2\ - L-'^,/n){\C\ - 7L-^^/E). In 

total we obtain 

NiKB > Q'j, + Qj,j^ + Qj,C' >qk\J2\- 0{L-'^qn). 
Also, applying Lemma 7.1 to C we have Qc > (1 — i^~^)|C|(|C| — L^^-^/n), so 



NiKC > Qk\c,C' +Qc> qk\C\ - 0{L~^qn). 

(For Qk\c,C' apply the same method as for Qj^c'-) When q < n^^/^ these two estimates will 
suffice: we have 

^<n-i/6 ^ N,K> NiKB + NiKC >qk{\C\ + \J2\)- 0{L-^qn) 

= qk(\C\+ J2 -OiL-^qn). (9) 

V e=e*+i / 

When q > n~^^^ we improve this estimate as follows. Let z' be such that > r?l^ for i < z' and 
he, < n^l^ otherwise. Note that z' < t. We write NiK > T,e=iNiKHe + NiKB + NiKC J where an 
open ordered pair counted by NiK is counted by NiK Hi , NiKB or NiKc according as its first vertex 
is in Hi, B \ U|^j^i?£ or C. For i £ [z'] we claim that 

q > n-V6 ^ NiKH, > (1 - L-')qhek/2. (10) 



To see this, we first apply Lemma 7.2 for each i' such that hi' > n^l^ to R = Hi \ N{xii) and 
S = He \ N{xe). This is valid by the codegree bound, as each has size at least n^^^ — L^, and 
any vertex with a neighbour in one set has at most < L~'^^q{n^/^ — L'^) neighbours in the 
other. Thus we obtain QhiH., = (1 i L~^)qhihi' . Then we apply Lemma 7.2 with R = Hi and 



S = K' = K \ which is valid as any vertex with a neighbour in one set has at most 

^1/5 < ^-20 neighbours in the other (by the hole construction procedure). Thus we obtain 



QhcK' > {1- L ^)qhi{\K'\ - r?l^). Since k-hi- r?l^ > k/2 this proves (10). Using the estimates 
from above for Nikc and NiKB we have 

q>n-^''' N,K>qk(\C\+f2he/2+ ^ hA - OiL-^qn). (11) 

V e=i £=£'■+1 J 

To estimate the overcount Nio , first note that there is no overcounting between NiAo + NiSo and 
NiBi + NiK', pairs counted by the former do not intersect K while pairs counted by the latter do 
intersect K. There is also no overcounted pair between Nij^^ and NiBo^ as such a pair would have 
to contain xi with £ G Zb to lie in NiSo and no such pair lies in NiAo- There is no overcounted pair 
between NiBi and NiAi as the hole construction procedure ensures that no vertex in a hole Hi with 
t G Zb is also a neighbor of some vertex xii such that G Za- It follows from the co-degree bound 
that the overcount between NiAi and NiK is 0(n^/2). To bound the overcount between NiAi and 
NiAo, note that this is determined by naming vertices xi,xi' such that G Za and a vertex b in 
the (final) neighborhood of xi. The number of choices for these is at most \ZA\'^d = 0(n^/^). Also, 
the overcount between NiAi and NiBo is determined by naming a vertex b £ B, a, vertex xi such 
that £ G Za, and a vertex c that is in the (final) common neighborhood of xi and b. The number of 
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choices is bounded by k\ZA\L^ = 0{n^^^). This leaves the overcount between Nix and NiSi, which 
is the most significant potential source of overcounting. 

In order to bound the overcount between Nix and NiSi we introduce a definition. We say that 
a hole Hf with £ S Zb is 'black' if xi has more than L^'^hf neighbours in K. Let BH be the set 
of vertices that belong to black holes, and let XH be the corresponding set of vertices xg. Note 



that Lemma 7.3 li) implies \BH\ < L^^^k. The contribution to NiBi of pairs that would close edges 
incident to black holes is at most 3y\BH\ < 3L~^^yk < SL^^^qkn^^"^ . Now consider overcounted 
pairs that would close pairs that are not incident to black holes. Such a pair has the form vijVi'ji 
where xiVi'j' is an edge, so i' < i by the hole construction procedure. If hg > r?l^ then by the 
codegree bound there are at most n~'^l'^k ■ < n^^^ such edges xiVi'j' with the fixed xi. These are 
only counted in our estimate for NiK while q > n^^/^, so the overcount for such a hole is at most 
hiU^^^ < hiqn?^^. For holes Hg such that hg < n^/^, recall that open pairs between Hi and H^r are 
only counted in our estimate for if hi < h* < L~'^^qy/n. Since Hg is not black, the number of 



choices for vgiji is at most L^^hg < L ^^q^/n. The total contribution of such pairs to NiB is at most 
L~^^hiqy/n. Thus 

Nio < 5L-^^qn^/'^k = 0{L-'^n). (12) 

Next we substitute l — Ni/Q < exp —{l — 2eQ)q~^{NiAi + NiAo + NiB + — Nio) in the estimate 
for F{£k) given in ^ to obtain 

-logP(£:x) > SM-TA+SB-TB+SAo+SK-So + {'^logn-log2) j ^ + |5| | -©(n^/^--), (13) 



where 



S^ = J2N^^^<1~' ior ^ie{Ai,Ao,B,K,0}, 

i=l 

de he 
eeZA j=i eeZg j=i 



2 



We proceed to show that the contributions from Sai — Ta and Sb — Tb are negligible. The remaining 
terms will be used to balance the number of events in our union bound calculation. 
To estimate Sai, note that 

m de I'lj 

i=l £gZa (-^Za j=l i=ie(j-i) 

de ^to 

= EEE4--^ 

= EE24-EE2^^.--^/' = ^-o(n-) 



tdZA j = i t&ZA i=i 
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Using iij = jn/2 ± n^/^ '^/'^ and dg = 2i„iax/n it v}^"^ '^/^ we have 

i=l IdZA (.dZA J=l i=H{j~i) 

2ima:./n-2nl/2-E/3 

> \Za\ (2n)-^j2 
i=i 

Similarly, we have 

teZyl j = l i = l 

<|Z^| /(2n)-i + 0(ni/2-^/3). 

i=i 

Putting everything together we have Ta — Sa < 0{n^^'^~'^^^). Similarly, for Sb we have 

m hi ''■tj 

= E EE8^^"' = ^^-o(™~')- 

^6Zj3 j=l i=l 

This gives Tb-Sb< OiL-^n^/^). 

We are now ready for the union bound bound calculation. Recall that we have fixed the initial 
data that defines the event £; that is, we have specified z, the collection of vertices xi,X2, ■ ■ ■ ,Xz, 
the collection of hole size hi, . . . ,hz and the degrees of vertices X£ for i € Za- The number of choices 
for the data that defines an event £k is at most 



(Note that we name the vertices in A U i? by specifying the vertices in holes.) To estimate ¥{£) we 
apply (13) to each such choice of £k, using dg = 2imax/n±n^^'^~'^^'^ and (^^^ < exp O (log log n)/i£ for 



i £ Za, that Ta-Sa, Tb-Sb and So are at most 0{L-'^n^^^), and Sao > 2|Za| w/n-0(nV2-£/5) 
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by ([7|. This gives 



IP(^)< n (^-7^) exp{-2w/n + 0(loglogn)M 



ne 2m\ M / n \ _c,^. , n(-/-,-2„i/2 




< exp I hilogiV^/he) + \C\ logn/2 - Sk + 0(loglogn)fc 

It remains to show that Sk is sufficiently large. Note that contribution from pairs with first vertex 
in C is \C\kimax/n''^ and we have an error term bounded by 0(L^^n^/^). For each hole Hi C AUB we 
consider the contribution from open pairs with first vertex in H^. If hi > n?^^ (i.e. i < z') we obtain 



a term qkhe/2 in (11) while q > n i.e. up to time ti = logra. The resulting contribution to 



Sk is tin^/^ • khi/2'in?. If hi < ri}l^ we obtain a term qkhi in (|9|) and (11) while q > L^^hi/ y/n, i.e. 



up to time ti = ^ylog jruj^- The resulting contribution to Sk is tin^/'^ ■ khi/v? . Thus we have 



Note that hi > 1 implies y logn) • log(^) > log j^. RecaUing that A; = (1 + e')\/2nlogn and 
m = y/{l/2 — e) logn • and e' = 3e we have 

/ \ 1 \ ^ ' / 

iogP(0<-X]^M 77^-Y^)iog™- /i4^^^V^°^€ "^'^1^°^''/^ + ^^^°^^°^''^^ 



< -eA;logn/(2V10) + 0(log log n)/c. 

As the number of choices of the initial data that defines £ is 0(n^^) and z < 4fcL^^~^", this completes 
the proof. □ 

7.3 Proof of Theorem [TTl 



This proof is very similar to that of Theorem 1.2, but much simpler. The lower bound on degrees in 



G follows from Theorem 2.2, so it remains to show the upper bound. Let e' = be. We take a union 



bound over every vertex x, potential neighborhood A, and set C such that 

\C\ =e'^n\og n 

of the event that 

1. A is the neighborhood of x in G{tmax), 

2. AU C spans no edge in G{tmax), and 
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3. vx is open in G{tmax) for all v £ C. 

We view C as vertices that might be added to the neighborhood of v between time tmax and the end 
of the process. We show that whp there is no triple {x, A, C) with these properties. 
Let X, A, C with |^| = d! = d{l it be fixed, where we write 

d = Itmaxn^^"^ = \J (1/2 - e)nlogn. 

We also specify an appearance time ij for every edge xvj where A = {vi, . . . , u^/}. Let F be the event 
^ U C is an independent set in G{tmax), all pairs joining x and C are open in G{tmax) and all the 
specified edges appear at the specified steps of the process. In order to estimate the probability of 
the event for each step i we need to estimate the probability that the selected edge is compatible 
with this event, conditional on the history of the process. We say i is a selection step if i is one 
of ij for j G [d']; then the selected edge is specified by so the required probability is simply 
= (1 =t 2eQ)2q~^. For other i, the required probability is 1 — Ni/Q, where Ni is the number of 
ordered open pairs that cannot be selected at step i when F occurs. \ii = ij is a selection step write 
Ni = 0. Then we estimate 

d' m 

< ± 2eQ)2g(t,,-)"' • Ui^ " Ni/Q). 
j=i i=i 

We write Ni = NiA + Nic, where NiA counts the ordered open pairs within A and Nic counts those 



in Au C with at least one vertex in C. Following the proof of Theorem 1.2 we have 

-logP(J-) >Sa-Ta + Sc + (21ogn - log2)d' - 0{n^/^), 

where = YZi ^i^Q'^ for € {A, C} and T4 = Zf=i ^t]. 

Following the argument in the previous section for estimating Sai — Ta we have 

SA-TA = d{n^'^-''''). 

In order to estimate Sc we make the following crucial observation. If some vertex u has degree at 
least L^n^ > 2y{tmax) in C then at time t^ax we have Y^u > 2y. We can assume xu is a non-edge as 
X is open to C, so this contradicts our estimate on y-variables. Thus no such vertex u exists. While 



qk > L^^vf , which holds up to time (1 + o{l))tmax, we can apply Lemmas 7.1 and 7.2 to obtain 
> (1 - L-^)q\C? and Qac > (1 - L'^)q\A\\C\. This gives 



Sc>{l- o{l)){\C\' + 2\A\.\C\)\J(^--ey^ = {l- o(l)) f 1 - 2. + e' ^j\^e\ \C\\\o^ 



n. 



Now, taking the union over all possible choices of the data that specifies an event J-, namely the 
choices of v, d' , A, C and the collection of times at which the edges joining v to A appear, we see 
that the probability that a triple (x. A, C) exists with the given conditions is at most 



, , , , . .. , d' 



^ \d' 

d' 



;) (|2|) (;^) exp I -(1 - o(l)) il-2e + e' ^j\^e\ \C\ \\ogn + 0{n'/') 
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where the sum is taken over d! in the interval (1 it n~^^^)d. Since e' = 5e, assuming e < 1/4, this 
probability is at most 

n exp |— e|C|g logn} . 

d' 

Thus the required bound on degrees holds with high probability. □ 

8 Concluding remarks 

We have determined R{3,t) to within a factor of 4 + o(l), so we should perhaps hazard a guess 
for its asymptotics: we are tempted to believe the construction rather than the bound, i.e. that 
R{3,t) ~ t^/41ogt. We should note that we only have an upper bound on the independence number 
of the graph G produced by the triangle-free process. So, formally speaking, the triangle-free process 
could produce a graph that gives a better lower bound on R{3,t). But we believe that this is not 
the case; that is, we conjecture that the bound on the independence number in Theorem 1.2 is 
asymptotically best possible. 

Another natural direction for future research is to provide an asymptotically optimal analysis in 
greater generality for the H-iiee process. No doubt the technical challenges will be formidable, given 
the difficulties that arise in the case of triangles. But on an optimistic note, it is encouraging that 
one can build on two different proofs of this case. 
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